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METHODS OF DIAGNOSIS OF PROSTATE CANCER, 
COMPOSITIONS AND METHODS OF SCREENING FOR 
MODULATORS OF PROSTATE CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority from the following applications: USSN 
09/687,576 filed October 13, 2000, USSN 60/276,791 filed March 16, 2001; USSN 
60/288,589, filed May 4, 2001; USSN 09/733,742, filed December 8, 2000; USSN 
10 09/733,288, filed December 8, 2000; USSN 09/847,046, filed April 30, 2001; USSN 
60/276,888, filed March 16, 2001; USSN 60/286,214, filed April 24, 2001; USSN 
60/281,922, filed April 6, 2001; USSN 60/263,957, filed January 24, 2001, which are 
incorporated herein by reference in their entirety. 

15 FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
prostate cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of prostate cancer. The invention further relates to methods for 
20 identifying and using agents and/or targets that inhibit prostate cancer. 

BACKGROUND OF THE INVENTION 
Prostate cancer is the most commonly diagnosed internal malignancy and 
second most common cause of cancer death in men in the U.S., resulting in approximately 
25 40,000 deaths each year ( Landis et al., CA Cancer J. Clin. 48:6-29 (1998); Greenlee et al., 
CA Cancer J. Clin. 50(1):7-13 (2000)), and incidence of prostate cancer has been increasing 
rapidly over the past 20 years in many parts of the world (Nakata et al., Int. J. Urol. 
7(7):254-257 (2000); Majeed et al., BJU Int. 85(9): 1058-1062 (2000)). It develops as the 
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result of a pathologic transformation of normal prostate cells. In tumorigenesis, the cancer 
cell undergoes initiation, proliferation and loss of contact inhibition, culminating in invasion 
of surrounding tissue and, ultimately, metastasis. 

Deaths from prostate cancer are a result of metastasis of a prostate tumor. 
5 Therefore, early detection of the development of prostate cancer is critical in reducing 

mortality from this disease. Measuring levels of prostate-specific antigen (PSA) has become 
a very common method for early detection and screening, and may have contributed to the 
slight decrease in the mortality rate from prostate cancer in recent years (Nowroozi et al., 
Cancer Control 5(6):522-531 (1998)). However, many cases are not diagnosed until the 

10 disease has progressed to an advanced stage. 

Treatments such as surgery (prostatectomy) , radiation therapy, and 
cryotherapy are potentially curative when the cancer remains localized to the prostate. 
Therefore, early detection of prostate cancer is important for a positive prognosis for 
treatment. Systemic treatment for metastatic prostate cancer is limited to hormone therapy 

15 and chemotherapy. Chemical or surgical castration has been the primary treatment for 
symptomatic metastatic prostate cancer for over 50 years. This testicular androgen 
deprivation therapy usually results in stabilization or regression of the disease (in 80% of 
patients), but progression of metastatic prostate cancer eventually develops (Panvichian et al., 
Cancer Control 3(6):493-500 (1996)). Metastatic disease is currently considered incurable, 

20 and the primary goals of treatment are to prolong survival and improve quality of life (Rago, 
Cancer Control 5(6):5 13-521 (1998)). 

Thus, methods that can be used for diagnosis and prognosis of prostate cancer 
and effective treatment of prostate cancer, and including particularly metastatic prostate 
cancer, would be desirable. Accordingly, provided herein are methods that can be used in 

25 diagnosis and prognosis of prostate cancer. Further provided are methods that can be used to 
screen candidate bioactive agents for the ability to modulate, e.g., treat, prostate cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in prostate cancer and other cancers. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in prostate cancer cells. Such genes are useful for diagnostic 
purposes, and also as targets for screening for therapeutic compounds that modulate prostate 
5 cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a prostate 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
10 sequence at least 80% identical to a sequence as shown in Tables 1-16. 

In one embodiment, the present invention provides a method of determining 
the level of a prostate cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
prostate cancer-associated transcript in a cell from a patient, the method comprising 
15 contacting a biological sample from the patient with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-16. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-16. In another embodiment, the 
polynucleotide comprises a sequence as shown in Tables 1-16. 
20 In one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

prostate cancer. In another embodiment, the patient is suspected of having metastatic 
prostate cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the patient is suspected of having a taxol-resistant cancer. 
30 In one embodiment, the prostate cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of prostate cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a prostate cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-16, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic prostate 
10 cancer. In a further embodiment, the patient has a drug resistant (e.g., taxol resistant) form of 
prostate cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the prostate cancer-associated transcript to a level of the prostate cancer- 
associated transcript in a biological sample from the patient prior to, or earlier in, the 

15 therapeutic treatment. 

Additionally, provided herein is a method of evaluating the effect of a 
candidate prostate cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 

20 healthy individual. In a preferred embodiment, said expression profile includes a gene of 
Tables 1-16. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-16. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

25 acid. 

In one aspect, the present invention provides an isolated polypeptide which is 
encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-16. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
30 polynucleotide sequence as shown in Tables 1-16. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a prostate 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to prostate cancer in a patient, the method comprising contacting a 
10 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-16. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a prostate cancer-associated polypeptide, the method comprising 
the steps of: (i) contacting the compound with a prostate cancer-associated polypeptide, the 
15 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-16; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a prostate cancer-associated cell to treat prostate cancer in a patient, the 

method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having prostate 

cancer or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
5 
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polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-16 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
the level of expression of the polynucleotide is a candidate for the treatment of prostate 
5 cancer. 

In one embodiment, the control is a mammal with prostate cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
10 concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 

hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-16 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 

preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 

mammal having prostate cancer comprising administering a compound identified by the 

assay described herein. 
20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having prostate cancer, the composition comprising a 

compound identified by the assay described herein and a physiologically acceptable 

excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a 
prostate cancer. In one embodiment, a gene is selected from Tables 1-16. The method 
further includes adding a drug candidate to the cell and determining the effect of the drug 
candidate on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 
expression in the presence of the drug candidate, wherein the concentration of the drug 
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candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate prostate 
5 cancer drug comprising administering the drug to a transgenic animal expressing or 

over-expressing the prostate cancer modulatory protein, or an animal lacking the prostate 
cancer modulatory protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 
segments of Tables 1-16, wherein the biochip comprises fewer than 1000 nucleic acid probes. 

10 Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with prostate 
cancer is provided. The method comprises determining the expression of a gene of Tables 1- 
16, in a first tissue type of a first individual, and comparing the distribution to the expression 

15 of the gene from a second normal tissue type from the first individual or a second unaffected 
individual. A difference in the expression indicates that the first individual has a disorder 
associated with prostate cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 
of a gene that is not up- and down-regulated in prostate cancer. 

20 In one embodiment a method for screening for a bioactive agent capable of 

interfering with the binding of a prostate cancer modulating protein (prostate cancer 
modulatory protein) or a fragment thereof and an antibody which binds to said prostate 
cancer modulatory protein or fragment thereof. In a preferred embodiment, the method 
comprises combining a prostate cancer modulatory protein or fragment thereof, a candidate 

25 bioactive agent and an antibody which binds to said prostate cancer modulatory protein or 
fragment thereof. The method further includes determining the binding of said prostate 
cancer modulatory protein or fragment thereof and said antibody. Wherein there is a change 
in binding, an agent is identified as an interfering agent. The interfering agent can be an 
agonist or an antagonist. Preferably, the agent inhibits prostate cancer. 

30 Also provided herein are methods of eliciting an immune response in an 

individual. In one embodiment a method provided herein comprises administering to an 
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individual a composition comprising a prostate cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
of Tables 1-16. 

Further provided herein are compositions capable of eliciting an immune 
5 response in an individual. In one embodiment, a composition provided herein comprises a 
prostate cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-16, or a 
fragment thereof, and a pharmaceutically acceptable carrier. In another embodiment, said 
composition comprises a nucleic acid comprising a sequence encoding a prostate cancer 
modulating protein, preferably selected from the nucleic acids of Tables 1-16, and a 

10 pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a prostate cancer 
protein, or a fragment thereof, comprising contacting an agent specific for said protein with 
said protein in an amount sufficient to effect neutralization. In another embodiment, the 
protein is encoded by a nucleic acid selected from those of Tables 1-16. 

1 5 In another aspect of the invention, a method of treating an individual for 

prostate cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a prostate cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having prostate cancer an antibody to a prostate 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

20 be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and prognosis evaluation for prostate cancer (PC), including 
25 metastatic prostate cancer, as well as methods for screening for compositions which modulate 
prostate cancer. Also provided are methods for treating prostate cancer. 

In addition to the other nucleic acid and peptide sequences, the present 
invention also relates to the identification of PAA2 as a gene that is highly over expressed in 
prostate cancer patient tissues. PAA2 sequence is identical to the zinc transporter ZNT4. 
30 Results presented herein demonstrate that PAA2/ZNT4 is highly expressed in prostate cancer 
cells. The prostate gland is unique in that it has the highest capacity of any organ in the body 



WO 02/30268 



PCT/US01/32045 



to accumulate zinc. Zinc uptake is regulated by prolactin and testosterone, which induce the 
expression of a member of the ZIP family of zinc transporters (Costello et al., 1999, J. Biol. 
Chem. 274:17499-17504). Zinc accumulation in the prostate functions to inhibit citrate 
oxidation, which results in a decrease in cellular ATP production (Costello and Franklin, 
5 1998, Prostate 35:285-296). Cancer cells are more sensitive to decreased ATP production 
and have evolved to prevent zinc accumulation. Without wishing to be bound by theory, the 
up-regulation of ZNT4 in prostate cancer cells may result in protection of the cells from high 
zinc levels by its ability to pump accumulated zinc out of the cells. 

The present invention also relates to nucleic acid sequencess encoding PBH1. 

10 PBH1 is related to human TRPC7 (transient receptor potential-related channels, NP_003298), 
a putative calcium channel highly expressed in brain (Nagamine et al., Genomics 54:124-131 
(1998)). Tip is related to melastatin, a gene down-regulated in metastatic melanomas 
(Duncan et al., Cancer Res. 58:1515-1520 (1998)), and MTR1, a gene locallized to within the 
Beckwith-Wiedemann syndrome/Wilm's tumor susceptability region (Prawitt et al., Hum. 

15 Mol. Genet. 9:203-216 (2000)). Without wishing to be bound by theory, it is believed that 
PBH1 functions as a calcium channel. 

As a calcium channel, PBH1 is an ideal target for a small molecule 
therapeutic, or a therapeutic antibody that disrupts channel function. CD20, the target of 
Rituximab in non-Hodgekin's lymphoma (Maloney et al., Blood 90:2188-2195 (1997); Leget 

20 and Czuczman, Curr. Opin. Oncol. 10:548-551 (1998)), is a plasma membrane calcium 
channel expressed in B cells (Tedder andEngel, Immunol. Today 15:450-454 (1994)). 
Similarly, a small molecule, or antibody that inhibits or alters a calcium signal mediated by 
PBH1, will result in the death of prostate cancer cells. 

PBH1, and other genes of the invention, are also be useful as targets for 

25 cytotoxic T-Iymphocytes. Genes that are tumor specific, or that are expressed in immune- 
privileged organs, are currently being used as potential vaccine targets (Van den Eynde and 
Boon, Int. J. Clin. Lab. Res. 27:81-86 (1997)). The expression pattern of PBH1 indicates that 
it is an ideal target for cytotoxic T-lymphocytes. Thus, therapies that utilize PBHl-specific 
cytotoxic T-lymphocytes to induce prostate cancer cell death are also provided by this 

30 invention. See, e.g., U.S. Patent No. 6,051,227 and WO 00/32231, the disclosures of which 
are herein incorporated by reference. 
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The present invention is also related to the identification of PAA3 as a gene 
that is important in the modulation of prostate cancer and or breast cancer. 

Tables 1-16 provide unigene cluster identification numbers, exemplar 
accession numbers, or genomic nucleotide position numbers for the nucleotide sequence of 
5 genes that exhibit increased or decreased expression in prostate cancer samples. 



Definitions 

The term "prostate cancer protein" or "prostate cancer polynucleotide" or 
"prostate cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic 

10 variants, alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence 
that has greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 
90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a unigene 

15 cluster of Tables 1-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an 
immunogen comprising an amino acid sequence encoded by a nucleotide sequence of or 
associated with a unigene cluster of Tables 1-16, and conservatively modified variants 
thereof; (3) specifically hybridize under stringent hybridization conditions to a nucleic acid 
sequence, or the complement thereof of Tables 1-16 and conservatively modified variants 

20 thereof or (4) have an amino acid sequence that has greater than about 60% amino acid 

sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% or greater amino sequence identity, preferably over a region of over 
a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid 
sequence encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 

25 1 - 1 6. A polynucleotide or polypeptide sequence is typically from a mammal including, but 
not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, 
or other mammal. A "prostate cancer polypeptide" and a "prostate cancer polynucleotide," 
include both naturally occurring or recombinant forms. 

A "full length" prostate cancer protein or nucleic acid refers to a prostate 

30 cancer polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the 
elements normally contained in one or more naturally occurring, wild type prostate cancer 
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polynucleotide or polypeptide sequences. For example, a full length prostate cancer nucleic 
acid will typically comprise all of the exons that encode for the full length, naturally ocurring 
protein. The "full length" may be prior to, or after, various stages of post-translation 
processing or splicing, including alternative splicing. 
5 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a prostate cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

10 blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 
pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

15 "Providing a biological sample" means to obtain a biological sample for use in 

methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 

20 be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 

25 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 

30 be "substantially identical." This definition also refers to, or may be applied to, the 

compliment of a test sequence. The definition also includes sequences that have deletions 
11 
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and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
5 50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 

10 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
one of the number of contiguous positions selected from the group consisting typically of 

15 from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 

20 Math. 2:482 (198 1), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

25 visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al. , eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 
which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 

30 al, J. Mol. Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
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the invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
5 threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 

10 for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 

15 due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 

20 defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl. Acad. Set USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Ptoc. Nat'l. Acad. Sci. USA 90:5873- 

25 5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 

30 preferably less than about 0.01, and most preferably less than about 0.001. Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

13 
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An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
5 polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

10 A "host cell" is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like {see, e.g., the American Type Culture 

15 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

20 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

25 that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 

30 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
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naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
5 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

10 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

15 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

20 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

25 most proteins. For instance, the codons QCA, GCC, GCG and GCU all encode the amino 
acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 

30 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
15 
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only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 
the expression product, but not with respect to actual probe sequences. 
5 As to amino acid sequences, one of skill will recognize that individual 

substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 

10 Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention.typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

15 Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) {see, 
e.g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 

20 e.g., Alberts et al. , Molecular Biology of the Cell (3 rd ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Parti: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 

25 often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of (5-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
dimensional structure formed, usually by the noncovalent association of independent tertiary 

30 units. Anisotropic terms are also known as energy terms. 
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"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 
5 polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 

10 Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 

peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
Symposium Series 580, Carbohydrate Modifications inAntisense Research, Sanghui & 

15 Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 

20 analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 
(1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 

25 (1984), Letsinger et al., J. Am. Chem. Soc. 1 10:4470 (1988); and Pauwels et al., Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et al., Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al, J. Am. Chem. Soc. 111:2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 

30 linkages (seeEgholm, J. Am. Chem. Soc. 114:1895 (1992); Meier et al., Chem. Int. Ed. Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
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of which are incorporated by reference). Other analog nucleic acids include those with 
positive backbones (Denpcy et al., Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic 
backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsingeret al., J. Am. 
5 Chem. Soc. 1 10:4470 (1988); Letsinger et al., Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal 
Chem. Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 

10 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 
acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 

1 5 expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

20 kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
due to their non-ionic nature, hybridization of the bases attached to these backbones is 
relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 

25 enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
by those in the art, the depiction of a single strand also defines the sequence of the 
complementary strand; thus the sequences described herein also provide the complement of 

30 the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 



WO 02/30268 



PCT/US01/32045 



combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 
naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
5 nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 
acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 

10 means. For example, useful labels include fluorescent dyes, electron-dense reagents, enzymes 
(e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins or other 
entities which can be made detectable, e.g., by incorporating a radiolabel into the peptide or 
used to detect antibodies specifically reactive with the peptide. The radioisotope may be, for 
example, 3H, 14C, 32P, 35S, or 1251. In some cases, particularly using antibodies against the 

15 proteins of the invention, the radioisotopes are used as toxic moieties, as described below. 
The labels may be incorporated into the prostate cancer nucleic acids, proteins and antibodies 
at any position. Any method known in the art for conjugating the antibody to the label may 
be employed, including those methods described by Hunter et al., Nature , 144:945 (1962); 
David et al., Biochemistry . 13:1014 (1974); Pain et al., J. Immunol. Meth. . 40:219 (1981); 

20 and Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). The lifetime of radiolabeled 
peptides or radiolabeled antibody compositions may extended by the addition of substances 
that stablize the radiolabeled peptide or antibody and protect it from degradation. Any 
substance or combination of substances that stablize the radiolabeled peptide or antibody may 
be used including those substances disclosed in US Patent No. 5,961,955. 

25 An "effector" or "effector moiety" or "effector component" is a molecule that 

is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, e.g., detection moieties including 
radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 

30 tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

19 
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A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
5 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, strep tavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

10 through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 
functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 

15 It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 
hybridization conditions. The probes are preferably directly labeled as with isotopes, 
chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 

20 one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 

25 native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are otherwise abnormally expressed, under expressed 
or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 

30 polymerases and endonucleases, in a form not normally found in nature. In this manner, 

operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
20 
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form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
5 host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

10 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 

15 coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
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particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
5 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence that is 
determinative of the presence of the nucleotide sequence, in a heterogeneous population of 
nucleic acids and other biologies (e.g., total cellular or library DNA or RNA). Similarly, the 
phrase "specifically (or selectively) binds" to an antibody or "specifically (or selectively) 
immunoreactive with," when referring to a protein or peptide, refers to a binding reaction that 
10 is determinative of the presence of the protein, in a heterogeneous population of proteins and 
other biologies. Thus, under designated immunoassay or nucleic acid hybridization 
conditions, the specified antibodies or nucleic acid probes bind to a particular protein 
nucleotide sequences at least two times the background and more typically more than 10 to 
100 times background. 

15 Specific binding to an antibody under such conditions requires an antibody 

that is selected for its specificity for a particular protein. For example, polyclonal antibodies 
raised to a particular protein, polymorphic variants, alleles, orthologs, and conservatively 
modified variants, or splice variants, or portions thereof, can be selected to obtain only those 
polyclonal antibodies that are specifically immunoreactive with the desired prostact cancer 

20 protein and not with other proteins. This selection may be achieved by subtracting out 

antibodies that cross-react with other molecules. A variety of immunoassay formats may be 
used to select antibodies specifically immunoreactive with a particular protein. For example, 
solid-phase ELISA immunoassays are routinely used to select antibodies specifically 
immunoreactive with a protein {see, e.g., Harlow & Lane, Antibodies, A Laboratory Manual 

25 (1988) for a description of immunoassay formats and conditions that can be used to 
determine specific immunoreactivity). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 

30 will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
22 
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Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 
thermal melting point (T ra ) for the specific sequence at a defined ionic strength pH. The T m is 
5 the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 

10 salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background^ preferably 10 times background hybridization. Exemplary stringent 

15 hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PGR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

20 62°C is typical, although high stringency annealing temperatures can range from about 50°C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 

25 reactions are provided, e.g., in Innis et al. (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 

30 degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
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under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
5 wash conditions can be utilized to provide conditions of similar stringency. Additional 

guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al. 

The phrase "functional effects" in the context of assays for testing compounds 
that modulate activity of a prostate cancer protein includes the determination of a parameter 

10 that is indirectly or directly under the influence of the prostate cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such as the ability to decrease prostate cancer. 
It includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 

15 tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 

metastasis, and other characteristics of prostate cancer cells. "Functional effects" include in 
vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 

20 prostate cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. 
Such functional effects can be measured by any means known to those skilled in the art, e.g., 
changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the prostate cancer protein; 

25 measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measuring cellular proliferation. Determination of the functional effect of a compound on 
prostate cancer can also be performed using prostate cancer assays known to those of skill in 
the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; 
contact inhibition and density limitation of growth; cellular proliferation; cellular 

30 transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 



WO 02/30268 



PCT/US01/32045 



expression in cells undergoing metastasis, and other characteristics of prostate cancer cells. 
The functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for prostate cancer-associated sequences, 
5 measurement of RNA stability, identification of downstream or reporter gene expression 
(CAT, luciferase, p-gal, GFP and the like), e.g., via chemiluminescence, fluorescence, 
colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of prostate cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 

10 or compounds identified using in vitro and in vivo assays of prostate cancer polynucleotide 
and polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally 
block activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate 
the activity or expression of prostate cancer proteins, e.g., antagonists. Antisense nucleic 
acids may seem to inhibit expression and subsequent function of the protein. "Activators" 

15 are compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, 
or up regulate prostate cancer protein activity. Inhibitors, activators, or modulators also 
include genetically modified versions of prostate cancer proteins, e.g., versions with altered 
activity, as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, 
small chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 

20 expressing the prostate cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of prostate cancer can also be identified by incubating 
prostate cancer cells with the test compound and determining increases or decreases in the 
expression of 1 or more prostate cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 

25 or more prostate cancer proteins, such as prostate cancer proteins encoded by the sequences 
set out in Tables 1-16. 

Samples or assays comprising prostate cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 

30 (untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 
25 
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preferably 50%, more preferably 25-0%. Activation of a prostate cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 110%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 
5 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

10 ability to form or suppress tumors when injected into suitable animal hosts, and/or 

immortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rd ed. 1994). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 

15 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 

20 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 

25 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

30 Fundamental Immunology. 
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An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
5 responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2,a 

10 dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 

15 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et al, Nature 

20 348:552-554 (1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et at, Immunology Today 4:72 (1983); Cole et al, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 

25 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2ded. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

30 display technology can be used to identify antibodies and heteromeric Fab fragments that 
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specifically bind to selected antigens {see, e.g., McCafferty et al, Nature 348:552-554 
(1990); Marks et al, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
5 (variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of prostate cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 

15 sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal prostate or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the prostate, or 
prostate cancer tissue or metastatic prostate cancerous tissue can be compared with tissue 

20 samples of prostate and other tissues from surviving cancer patients. By comparing 

expression profdes of tissue in known different prostate cancer states, information regarding 
which genes are important (including both up- and down-regulation of genes) in each of these 
states is obtained. 

The identification of sequences that are differentially expressed in prostate 
25 cancer versus non-prostate cancer tissue allows the use of this information in a number of 

ways. For example, a particular treatment regime may be evaluated: does a chemotherapeutic 
drug act to down-regulate prostate cancer, and thus tumor growth or recurrence, in a 
particular patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
30 analyzed to determine the stage of prostate cancer in the tissue. Furthermore, these gene 
expression profiles (or individual genes) allow screening of drug candidates with an eye to 
28 
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mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the prostate cancer expression profile. This may be done by making biochips 
comprising sets of the important prostate cancer genes, which can then be used in these 
screens. These methods can also be done on the protein basis; that is, protein expression 
5 levels of the prostate cancer proteins can be evaluated for diagnostic purposes or to screen 
candidate agents. In addition, the prostate cancer nucleic acid sequences can be administered 
for gene therapy purposes, including the administration of antisense nucleic acids, or the 
prostate cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

10 Thus the present invention provides nucleic acid and protein sequences that 

are differentially expressed in prostate cancer, herein termed "prostate cancer sequences." As 
outlined below, prostate cancer sequences include those that are up-regulated (i.e., expressed 
at a higher level) in prostate cancer, as well as those that are down-regulated (i.e., expressed 
at a lower level). In a preferred embodiment, the prostate cancer sequences are from humans; 

15 however, as will be appreciated by those in the art, prostate cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other 
prostate cancer sequences are provided, from vertebrates, including mammals, including 
rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, 
goats, pigs, cows, horses, etc.) and pets, e.g., (dogs, cats, etc.). Prostate cancer sequences 

20 from other organisms may be obtained using the techniques outlined below. 

Prostate cancer sequences can include both nucleic acid and amino acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, 
prostate cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids, as well as 

25 screening applications; e.g., biochips comprising nucleic acid probes or PCR microliter plates 
with selected probes to the prostate cancer sequences can be generated. 

A prostate cancer sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the prostate cancer sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 

30 generally determined as outlined below, using either homology programs or hybridization 
conditions. 
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For identifying prostate cancer-associated sequences, the prostate cancer 
screen typically includes comparing genes identified in different tissues, e.g., normal and 
cancerous tissues, or tumor tissue samples from patients who have metastatic disease vs. non 
metastatic tissue. Other suitable tissue comparisons include comparing prostate cancer 
5 samples with metastatic cancer samples from other cancers, such as lung, breast, 

gastrointestinal cancers, ovarian, etc. Samples of different stages of prostate cancer, e.g., 
survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to biochips 
comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 

10 commercially available, e.g. from Affymetrix. Gene expression profiles as described herein 
are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between 
normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal prostate, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 

15 muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the prostate cancer screen that are expressed in 
any significant amount in other tissues are removed from the profile, although in some 
embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

20 In a preferred embodiment, prostate cancer sequences are those that are up- 

regulated in prostate cancer; that is, the expression of these genes is higher in the prostate 
cancer tissue as compared to non-cancerous tissue. "Up-regulation" as used herein often 
means at least about a two-fold change, preferably at least about a three fold change, with at 
least about five-fold or higher being preferred. All unigene cluster identification numbers 

25 and accession numbers herein are for the GenBank sequence database and the sequences of 
the accession numbers are hereby expressly incorporated by reference. GenBank is known in 
the art, see, e.g., Benson, DA, et al, Nucleic Acids Research 26:1-7 (1998) and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 

30 In another preferred embodiment, prostate cancer sequences are those that are 

down-regulated in prostate cancer; that is, the expression of these genes is lower in prostate 
30 
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cancer tissue as compared to non-cancerous tissue {see, e.g., Tables 8, 12 and 14). "Down- 
regulation" as used herein often means at least about a 1.5-fold change more preferrably a 
two-fold change, preferably at least about a three fold change, with at least about five-fold or 
higher being most preferred. 

5 

Informatics 

The ability to identify genes that are over or under expressed in prostate 
cancer can additionally provide high-resolution, high-sensitivity datasets which can be used 

10 in the areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein 
structure, biosensor development, and other related areas. For example, the expression 
profiles can be used in diagnostic or prognostic evaluation of patients with prostate cancer. 
Or as another example, subcellular toxicological information can be generated to better direct 
drug structure and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, 

15 Mechanism, and Function, paper presented at the IBC Proteomics conference, Coronado, CA 
(June 11-12, 1998)). Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 

20 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 

25 database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 

30 databases can be assembled for any assay data acquired using an assay of the invention. 
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The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing prostate cancer, i.e., the identification of prostate cancer- 
associated sequences described herein, provide an abundance of information, which can be 
5 correlated with pathological conditions, predisposition to disease, drug testing, therapeutic 
monitoring, gene-disease causal linkages, identification of correlates of immunity and 
physiological status, among others. Although the data generated from the assays of the 
invention is suited for manual review and analysis, in a preferred embodiment, prior data 
processing using high-speed computers is utilized. 

10 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

15 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

20 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 

using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

25 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

30 the merger of two or more such tree structures. 
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See also Mount et ah, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin etal, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 
5 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
al, eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 

10 Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

15 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for prostate cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

20 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

25 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

30 transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
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the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides 
5 a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
10 be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
(e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
15 hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
20 ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
25 generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
30 comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
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degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.)', an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. ~ 

Characteristics of prostate cancer-associated proteins 

Prostate cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the prostate 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 
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proteins often results in unregulated or disregulated cellular processes (see, e.g., Molecular 
Biology of the Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
5 also serve as docking proteins that are involved in organizing complexes of proteins, or 
targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 

10 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

15 targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 

20 enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman et al, Nuc. 

25 Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 
al, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et al, Nuc. Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the prostate cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
30 They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already described 
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for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
5 of tyrosines on the receptor molecule itself, creates binding sites for additional SI 12 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

10 transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 

15 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted {see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 

20 transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 
receptor, IL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 

25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 

30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
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bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
with the extracellular matrix and contribute to the maintenance of the cell structure. 
5 Prostate cancer proteins that are transmembrane are particularly preferred in 

the present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 

10 typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

15 In another embodiment, the prostate cancer proteins are secreted proteins; the 

secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 

20 an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Prostate cancer proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 

25 for blood, plasma, serum, or stool tests. 

Use of prostate cancer nucleic acids 

As described above, prostate cancer sequence is initially identified by 
substantial nucleic acid and/or amino acid sequence homology or linkage to the prostate 
30 cancer sequences outlined herein. Such homology can be based upon the overall nucleic acid 
or amino acid sequence, and is generally determined as outlined below, using either 
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homology programs or hybridization conditions. Typically, linked sequences on a mRNA are 
found on the same molecule. 

The prostate cancer nucleic acid sequences of the invention, e.g., the 
sequences in Tables 1-16, can be fragments of larger genes, i.e., they are nucleic acid 
5 segments. "Genes" in this context includes coding regions, non-coding regions, and mixtures 
of coding and non-coding regions. Accordingly, as will be appreciated by those in the art, 
using the sequences provided herein, extended sequences, in either direction, of the prostate 
cancer genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Ausubel, et al, supra. Much can be done by 

10 informatics and many sequences can be clustered to include multiple sequences 
corresponding to a single gene, e.g., systems such as UniGene (see, 
http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the prostate cancer nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire prostate cancer nucleic acid 

15 coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant prostate cancer nucleic acid can be further-used as a probe to 
identify and isolate other prostate cancer nucleic acids, e.g., extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant prostate cancer nucleic 

20 acids and proteins. 

The prostate cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the prostate cancer nucleic acids are 
made and attached to biochips to be used in screening and diagnostic methods, as outlined 
below, or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 

25 Alternatively, the prostate cancer nucleic acids that include coding regions of prostate cancer 
proteins can be put into expression vectors for the expression of prostate cancer proteins, 
again for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to prostate cancer nucleic 
acids (both the nucleic acid sequences outlined in the figures and/or the complements thereof) 

30 are made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the prostate cancer nucleic acids, i.e. the target sequence (either the target 
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sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
5 single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 

10 conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 

15 and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 

20 two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 

25 equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 

30 attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
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equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
5 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as 
will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
10 the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 

15 in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 

20 plastics, etc. In general, the substrates allow optical detection and do not appreciably 

fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 

25 the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 

30 derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 
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amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
5 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 

10 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 

15 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 

20 in WO 95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of prostate cancer-associated sequences. These assays are typically performed in 

25 conjunction with reverse transcription. In such assays, a prostate cancer-associated nucleic 
acid sequence acts as a template in an amplification reaction (e.g., Polymerase Chain 
Reaction, or PCR). In a quantitative amplification, the amount of amplification product will 
be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of prostate cancer-associated RNA. Methods of 

30 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
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quantitative PCR are provided, e.g., in Innis et al, PCR Protocols, A Guide to Methods and 
Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
5 dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

10 amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et al, Science 
241:1077 (1988), and Barringer etal., Gene 89:117 (1990)), transcription amplification 
(Kwoh etal, Proc. Natl. Acad. Sci. USA 86:1173 (1989)), self-sustained sequence replication 

15 (Guatelli et al, Proc. Nat. Acad. Sci. USA 87 : 1 874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of prostate cancer proteins from nucleic acids 

In a preferred embodiment, prostate cancer nucleic acids, e.g., encoding 
20 prostate cancer proteins are used to make a variety of expression vectors to express prostate 
cancer proteins which can then be used in screening assays, as described below. Expression 
vectors and recombinant DNA technology are well known to those of skill in the art (see, 
e.g., Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and 
are used to express proteins. The expression vectors may be either self-replicating 
25 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the prostate cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
30 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 
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Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
5 to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 

10 restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the prostate cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 
sequences are known in the art for a variety of host cells. 

15 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

20 Promoter sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

25 example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 

30 The integrating vector may be directed to a specific locus in the host cell by selecting the 
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appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a 
selectable marker gene to allow the selection of transformed host cells. Selection genes are 
5 well known in the art and will vary with the host cell used. 

The prostate cancer proteins of the present invention are produced by culturing 
a host cell transformed with an expression vector containing nucleic acid encoding a prostate 
cancer protein, under the appropriate conditions to induce or cause expression of the prostate 
cancer protein. Conditions appropriate for prostate cancer protein expression will vary with 

10 the choice of the expression vector and the host cell, and will be easily ascertained by one 
skilled in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 

15 is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 

20 Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the prostate cancer proteins are expressed in 
mammalian cells. Mammalian expression systems are also known in the art, and include 
retroviral and adenoviral systems. One expression vector system is a retroviral vector system 

25 such as is generally described in PCT/US97/01019 and PCT/US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 
and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 

30 and the CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
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regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
5 well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

10 In a preferred embodiment, prostate cancer proteins are expressed in bacterial 

systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the tip and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 

15 promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 
binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the prostate cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 

20 between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 
bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 

25 such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 

components are assembled into expression vectors. Expression vectors for bacteria are well 
known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 

30 the art, such as calcium chloride treatment, electroporation, and others. 
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In one embodiment, prostate cancer proteins are produced in insect cells. 
Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, prostate cancer protein is produced in yeast cells. 
5 Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The prostate cancer protein may also be made as a fusion protein, using 
10 techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the prostate cancer protein may be fused to a carrier protein to form 
an immunogen. Alternatively, the prostate cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the prostate cancer protein is a 
prostate cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
15 acid for expression purposes. 

In a preferred embodiment, the prostate cancer protein is purified or isolated 
after expression. Prostate cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
20 and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the prostate cancer 
protein may be purified using a standard anti-prostate cancer protein antibody column. 
Ultrafiltration and diafiltration techniques, in conjunction with protein concentration, are also 
useful. For general guidance in suitable purification techniques, see Scopes, Protein 
25 Purification (1982). The degree of purification necessary will vary depending on the use of 
the prostate cancer protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the prostate cancer proteins and 
nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

30 



47 



WO 02/30268 



PCT/US01/32045 



Variants of prostate cancer proteins 

In one embodiment, the prostate cancer proteins are derivative or variant 
prostate cancer proteins as compared to the wild-type sequence. That is, as outlined more 
fully below, the derivative prostate cancer peptide will often contain at least one amino acid 
5 substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
prostate cancer peptide. 

Also included within one embodiment of prostate cancer proteins of the 
present invention are amino acid sequence variants. These variants typically fall into one or 

10 more of three classes: substitutional, insertional or deletional variants. These variants 

ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding the 
prostate cancer protein, using cassette or PCR mutagenesis or other techniques well known in 
the art, to produce DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture as outlined above. However, variant prostate cancer protein 

15 fragments having up to about 100-150 residues may be prepared by in vitro synthesis using 
established techniques. Amino acid sequence variants are characterized by the predetermined 
nature of the variation, a feature that sets them apart from naturally occurring allelic or 
interspecies variation of the prostate cancer protein amino acid sequence. The variants 
typically exhibit the same qualitative biological activity as the naturally occurring analogue, 

20 .although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

25 conducted at the target codon or region and the expressed prostate cancer variants screened 
for the optimal combination of desired activity. Techniques for making substitution 
mutations at predetermined sites in DNA having a known sequence are well known, e.g., 
M13 primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using 
assays of prostate cancer protein activities. 

30 Amino acid substitutions are typically of single residues; insertions usually 

will be on the order of from about 1 to 20 amino acids, although considerably larger 
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insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
5 minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the prostate cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 

10 elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the prostate cancer proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the prostate cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 

15 selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 

20 polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 
alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 

25 chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of prostate cancer polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a prostate cancer polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of a prostate 

30 cancer polypeptide. Derivatization with bifunctional agents is useful, for instance, for 

crosslinking prostate cancer polypeptides to a water-insoluble support matrix or surface for 
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use in the method for purifying anti-prostate cancer polypeptide antibodies or screening 
assays, as is more fully described below. Commonly used crosslinking agents include, e.g., 
l,l-bis(diazoacetyl)-2-phenylefhane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., 
esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl 
5 esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 

10 methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 
Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the prostate cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

15 of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 

herein to mean deleting one or more carbohydrate moieties found in native sequence prostate 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence prostate cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express prostate cancer-associated 

20 sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to prostate cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence prostate cancer polypeptide (for O-linked glycosylation sites). The prostate cancer 

25 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the prostate cancer polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
prostate cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 

30 polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 
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Removal of carbohydrate moieties present on the prostate cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
5 et al, Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al.,Anal. Biochem., 1 18:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al.,Meth. 
EnzymoL, 138:350(1987). 

Another type of covalent modification of prostate cancer comprises linking the 

10 prostate cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Prostate cancer polypeptides of the present invention may also be modified in 
a way to form chimeric molecules comprising a prostate cancer polypeptide fused to another, 

15 heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a prostate cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the prostate cancer polypeptide. The 
presence of such epitope-tagged forms of a prostate cancer polypeptide can be detected using 

20 an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 
prostate cancer polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of a prostate cancer polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 

25 the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS 6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 
al.,. Mol. Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 

30 9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al., 
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Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
{Hopp etal, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner etal, J. Biol. Chem. 
266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
5 Proc. Natl. Acad. Sci. USA 87:6393-6397 (1990)). 

Also included are other prostate cancer proteins of the prostate cancer family, 
and prostate cancer proteins from other organisms, which are cloned and expressed as 
outlined below. Thus, probe or degenerate polymerase chain reaction (PCR) primer 
sequences may be used to find other related prostate cancer proteins from humans or other 

10 organisms. As will be appreciated by those in the art, particularly useful probe and/or PCR 
primer sequences include the unique areas of the prostate cancer nucleic acid sequence. As is 
generally known in the art, preferred PCR primers are from about 15 to about 35 nucleotides 
in length, with from about 20 to about 30 being preferred, and may contain inosine as needed. 
The conditions for the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, 

15 supra). 

Antibodies to prostate cancer proteins 

In a preferred embodiment, when the prostate cancer protein is to be used to 
generate antibodies, e.g., for immunotherapy or immunodiagnosis, the prostate cancer protein 

20 should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller prostate cancer protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment; the epitope is unique; that is, 

25 antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 

30 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
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may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
5 adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 

dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler & 

10 Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 
immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 

15 16 fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 

lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 

20 pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 

25 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or humanized, antibodies that have binding 

30 specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
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protein encoded by a nucleic acid Tables 1-16 or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

5 In a preferred embodiment, the antibodies to prostate cancer protein are 

capable of reducing or eliminating a biological function of a prostate cancer protein, as is 
described below. That is, the addition of anti-prostate cancer protein antibodies (either 
polyclonal or preferably monoclonal) to prostate cancer tissue (or cells containing prostate 
cancer) may reduce or eliminate the prostate cancer. Generally, at least a 25% decrease in 
10 activity, growth, size or the like is preferred, with at least about 50% being particularly 
preferred and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the prostate cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
15 molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
20 human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
25 substantially all of at least one, and typically two, variable domains, in which all or 

substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
30 immunoglobulin (Jones et at, Nature 321:522-525 (1986); Riechmann et al., Nature 

332:323-329 (1988); andPresta, Curr. Op. Struct. Biol. 2:593-596 (1992)). Humanization 
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can be essentially performed following the method of Winter and co-workers (Jones et al, 
Nature 321:522-525 (1986); Riechmann et al, Nature 332:323-327 (1988); Verhoeyen et al, 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
5 chimeric antibodies (U.S . Patent No. 4,8 16,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom & Winter, J. Mol. Biol. 227:381 (1991); 

10 Marks et al , J. Mol Biol. 222:58 1 (199 1)). The techniques of Cole et al and Boerner et al. 
are also available for the preparation of human monoclonal antibodies (Cole et al, 
Monoclonal Antibodies and Cancer Tlierapy, p. 77 (1985) and Boerner et al, J. Immunol. 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

15 immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

human antibody production is observed, which closely resembles that seen in humans in all 
respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al, Bio/Technology 10:779- 

20 783 (1992); Lonberg et al, Nature 368:856-859 (1994); Morrison, Nature 368:812-13 
(1994); Fishwild et al, Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 
Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of prostate cancer with an antibody 
raised against prostate cancer proteins. As used herein, immunotherapy can be passive or 

25 active. Passive immunotherapy as defined herein is the passive transfer of antibody to a 

recipient (patient). Active immunization is the induction of antibody and/or T-cell responses 
in a recipient (patient). Induction of an immune response is the result of providing the 
recipient with an antigen to which antibodies are raised. As appreciated by one of ordinary 
skill in the art, the antigen may be provided by injecting a polypeptide against which 

30 antibodies are desired to be raised into a recipient, or contacting the recipient with a nucleic 
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acid capable of expressing the antigen and under conditions for expression of the antigen, 
leading to an immune response. 

In a preferred embodiment the prostate cancer proteins against which 
antibodies are raised are secreted proteins as described above. Without being bound by 
5 theory, antibodies used for treatment, bind and prevent the secreted protein from binding to 
its receptor, thereby inactivating the secreted prostate cancer protein. 

In another preferred embodiment, the prostate cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the prostate cancer protein and prevent it 

10 from binding to other proteins, such as circulating ligands or cell-associated molecules. The 
antibody may cause down-regulation of the transmembrane prostate cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
prostate cancer protein. The antibody is also an antagonist of the prostate cancer protein. 

15 Further, the antibody prevents activation of the transmembrane prostate cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the prostate cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-oc, TNF-P, DL-1, JNF-y 
and JL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

20 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, prostate cancer is 
treated by administering to a patient antibodies directed against the transmembrane prostate 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

25 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector 
moiety. The effector moiety can be any number of molecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the prostate cancer 

30 protein. In another aspect the therapeutic moiety modulates the activity of molecules 

associated with or in close proximity to the prostate cancer protein. The therapeutic moiety 
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may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
associated with prostate cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to prostate cancer tissue or cells, results 
5 in a reduction in the number of afflicted cells, thereby reducing symptoms associated with 
prostate cancer. Cytotoxic agents are numerous and varied and include, but are not limited 
to, cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 

10 radiochemicals made by conjugating radioisotopes to antibodies raised against prostate 
cancer proteins, or binding of a radionuclide to a chelating agent that has been covalently 
attached to the antibody. Targeting the therapeutic moiety to transmembrane prostate cancer 
proteins not only serves to increase the local concentration of therapeutic moiety in the 
prostate cancer afflicted area, but also serves to reduce deleterious side effects that may be 

15 associated with the therapeutic moiety. 

In another preferred embodiment, the prostate cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 

20 the individual or cell. Moreover, wherein the prostate cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The prostate cancer antibodies of the invention specifically bind to prostate 
cancer proteins. By "specifically bind" herein is meant that the antibodies bind to the protein 

25 with a Kd of at least about 0.1 mM, more usually at least about 1 uM, preferably at least about 
0.1 uM or better, and most preferably, 0.01 uM or better. Selectivity of binding is also 
important. 

Detection of prostate cancer sequence for diagnostic and therapeutic applications 

30 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the prostate cancer phenotype. Expression levels of genes in normal tissue 
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(i.e., not undergoing prostate cancer) and in prostate cancer tissue (and in some cases, for 
varying severities of prostate cancer that relate to prognosis, as outlined below) are evaluated 
to provide expression profiles. An expression profile of a particular cell state or point of 
development is essentially a "fingerprint" of the state. While two states may have any 
5 particular gene similarly expressed, the evaluation of a number of genes simultaneously 

allows the generation of a gene expression profile that is reflective of the state of the cell. By 
comparing expression profiles of cells in different states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 

10 sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 

15 qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus prostate cancer tissue. Genes may be turned on or turned off in a particular 
state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 

20 type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 
that expression is increased or decreased; i.e., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 

25 GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14: 1 675-1680 (1996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

30 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 
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Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the prostate 
5 cancer protein and standard immunoassays (ELISAs, etc.) or other techniques, including 
mass spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
prostate cancer genes, i.e., those identified as being important in a prostate cancer phenotype, 
can be evaluated in a prostate cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

10 simultaneously on a number of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the prostate cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of prostate cancer sequences 
in a particular cell. The assays are further described below in the example. PCR techniques 

15 can be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the prostate cancer protein 
are detected. Although DNA or RNA encoding the prostate cancer protein may be detected, 
of particular interest are methods wherein an mRNA encoding a prostate cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

20 complementary to and hybridizes with the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 

25 detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 

30 prostate cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
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secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
5 assays. The prostate cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing prostate cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
10 polypeptides. 

As described and defined herein, prostate cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of prostate cancer. 
Detection of these proteins in putative prostate cancer tissue allows for detection or diagnosis 
of prostate cancer. In one embodiment, antibodies are used to detect prostate cancer proteins. 

15 A preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the prostate cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the prostate cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

20 In another preferred method, antibodies to the prostate cancer protein find use 

in in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the prostate cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 

25 embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the prostate cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
detectable label. This method finds particular use in simultaneous screening for a plurality of 

30 prostate cancer proteins. As will be appreciated by one of ordinary skill in the art, many 
other histological imaging techniques are also provided by the invention. 
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In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing prostate 
5 cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of prostate cancer proteins. 
Antibodies can be used to detect a prostate cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
10 antibodies may indicate an immune response against an endogenous prostate cancer protein. 

In a preferred embodiment, in situ hybridization of labeled prostate cancer 
nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
prostate cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., 
Ausubel, supra) is then performed. When comparing the fingerprints between an individual 
15 and a standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on 
the findings. It is further understood that the genes which indicate the diagnosis may differ 
from those which indicate the prognosis and molecular profiling of the condition of the cells 
may lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

20 In a preferred embodiment, the prostate cancer proteins, antibodies, nucleic 

acids, modified proteins and cells containing prostate cancer sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to prostate cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, prostate cancer probes may be attached to 

25 biochips for the detection and quantification of prostate cancer sequences in a tissue or 

patient. The assays proceed as outlined above for diagnosis. PGR method may provide more 
sensitive and accurate quantification. 

Assays for therapeutic compounds 

30 In a preferred embodiment members of the proteins, nucleic acids, and 

antibodies as described herein are used in drug screening assays. The prostate cancer 
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proteins, antibodies, nucleic acids, modified proteins and cells containing prostate cancer 
sequences are used in drug screening assays or by evaluating the effect of drug candidates on 
a "gene expression profile" or expression profile of polypeptides. In a preferred embodiment, 
the expression profiles are used, preferably in conjunction with high throughput screening 
5 techniques to allow monitoring for expression profile genes after treatment with a candidate 
agent (e.g., Zlokarnik, et al, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the prostate cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified prostate cancer proteins 
are used in screening assays. That is, the present invention provides novel methods for 

10 screening for compositions which modulate the prostate cancer phenotype or an identified 
physiological function of a prostate cancer protein. As above, this can be done on an 
individual gene level or by evaluating the effect of drug candidates on a "gene expression 
profile". In a preferred embodiment, the expression profiles are used, preferably in 
conjunction with high throughput screening techniques to allow monitoring for expression 

15 profile genes after treatment with a candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays 
may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in prostate cancer, 
test compounds can be screened for the ability to modulate gene expression or for binding to 

20 the prostate cancer protein. "Modulation" thus includes both an increase and a decrease in 
gene expression. The preferred amount of modulation will depend on the original change of 
the gene expression in normal versus tissue undergoing prostate cancer, with changes of at 
least 10%, preferably 50%, more preferably 100-300%, and in some embodiments 300- 
1000% or greater. Thus, if a gene exhibits a 4-fold increase in prostate cancer tissue 

25 compared to normal tissue, a decrease of about four-fold is often desired; similarly, a 10-fold 
decrease in prostate cancer tissue compared to normal tissue often provides a target value of a 
10-fold increase in expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 

30 be monitored, e.g. , through the use of antibodies to the prostate cancer protein and standard 
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immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. . 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
5 typically involve a plurality of those entities described herein.. 

In this embodiment, the prostate cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of prostate cancer sequences 
in a particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microliter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
10 and analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more prostate cancer-associated sequences, e.g., a polynucleotide 
sequence set out in Tables 1-16. Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
15 that modulate prostate cancer, modulate prostate cancer proteins, bind to a prostate cancer 
protein, or interfere with the binding of a prostate cancer protein and an antibody or other 
binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 

20 molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

indirectly alter the prostate cancer phenotype or the expression of a prostate cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a prostate cancer phenotype, e.g. to a normal tissue 

25 fingerprint. In another embodiment, a modulator induced a prostate cancer phenotype. 

Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

30 Drug candidates encompass numerous chemical classes, though typically they 

are organic molecules, preferably small organic compounds having a molecular weight of 
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more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
5 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
10 are peptides. 

In one aspect, a modulator will neutralize the effect of a prostate cancer 
protein. By "neutralize" is meant that activity of a protein is inhibited or blocked and the 
consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
1 5 screened for an ability to bind to a prostate cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
20 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
25 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
30 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
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length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal, J. Med. Chem. 37(9):1233-1251 (1994)). 

Preparation and screening of combinatorial chemical libraries is well known to 
5 those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries {see, e.g., U.S. Patent No. 5,010,175, Furka, Pept. Prot. Res. 37:487-493 
(1991), Houghton et al, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 
Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 

10 hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 
114:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann etal, J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
of small compound libraries (Chen etal, J. Amer. Chem. Soc. 116:2661 (1994)), 

15 oligocarbamates (Cho, et al, Science 261: 1303 (1993)), and/or peptidyl phosphonates 

(Campbell et al, J. Org. Cliem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al, Nature 
Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 

20 e.g., Liang et al, Science 21 A: 1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 
U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

25 Devices for the preparation of combinatorial libraries are commercially 

available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
30 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
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Japan) and many robotic systems utilizing robotic arms (Zymate TJ, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
5 (if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

10 The assays to identify modulators are amenable to high throughput screening. 

Preferred assays thus detect enhancement or inhibition of prostate cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 

15 properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 

20 throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. FuIIerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 

25 dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 

30 transcription, ligand binding, and the like. 
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In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
5 Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 

mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or Iigands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 

10 30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 

15 these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

20 In one embodiment, the library is fully randomized, with no sequence 

preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 

25 hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 
nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of prostate cancer can also be nucleic acids, as defined below. As 
30 described above generally for proteins, nucleic acid modulating agents may be naturally 
occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
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example, digests of procaryotic or eucaryotic genomes may be used as is outlined above for 
proteins. 

In certain embodiments, the activity of a prostate cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
5 acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a prostate cancer protein mRNA, or a subsequence thereof. 
Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 

10 naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the prostate cancer 

15 protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

20 Antisense molecules as used herein include antisense or sense 

oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for prostate cancer molecules. Antisense or sense 

25 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and van der Krol et al. 
(BioTechniques 6:958 (1988)). 

30 In addition to antisense polynucleotides, ribozymes can be used to target and 

inhibit transcription of prostate cancer-associated nucleotide sequences. A ribozyme is an 
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RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 
ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et at, Adv. in 
Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
5 ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl. Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
WO 94/26877; Ojwang et al, Proc. Natl. Acad. Sci. USA 90:6340-6344 (1993); Yamada et 

10 al, Human Gene Therapy 1:39-45 (1994); Leavitt et al, Proc. Natl. Acad. Sci. USA 92:699- 
703 (1995); Leavitt et al, Human Gene Therapy 5:1151-120 (1994); and Yamada et al, 
Virology 205: 121-126(1994)). 

Polynucleotide modulators of prostate cancer may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 

15 molecule, as described in WO 9 1/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 

20 or its conjugated version into the cell. Alternatively, a polynucleotide modulator of prostate 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition" to methods of treatment. 

25 As noted above, gene expression monitoring is conveniently used to test 

candidate modultors (e.g., protein, nucleic acid or small molecule). After the candidate agent 
has been added and the cells allowed to incubate for some period of time, the sample 
containing a target sequence to be analyzed is added to the biochip. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 

30 lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 

amplification such as PGR performed as appropriate. For example, an in vitro transcription 
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with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
5 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

10 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

15 probes, as is generally outlined in U.S. Patent Nos. 5,681 ,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

20 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 

25 altering a step parameter that is a thermodynamic variable, including, but not limited to, 

temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 

30 steps at higher stringency conditions to reduce non-specific binding. 
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The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
5 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 

10 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the prostate cancer 
phenotype. In one embodiment, screening is performed to identify modulators that can 
induce or suppress a particular expression profile, thus preferably generating the associated 
phenotype. In another embodiment, e.g., for diagnostic applications, having identified 

15 differentially expressed genes important in a particular state, screens can be performed to 
identify modulators that alter expression of individual genes. In an another embodiment, 
screening is performed to identify modulators that alter a biological function of the 
expression product of a differentially expressed gene. Again, having identified the 
importance of a gene in a particular state, screens are performed to identify agents that bind 

20 and/or modulate the biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a prostate 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
prostate cancer gene expression profile so as to mimic the expression of the gene from 

25 normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated prostate cancer tissue reveals genes that are not expressed in 
normal tissue or prostate cancer tissue, but are expressed in agent treated tissue. These agent- 
specific sequences can be identified and used by methods described herein for prostate cancer 

30 genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells. In addition, antibodies can be raised against the 
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agent induced proteins and used to target novel therapeutics to the treated prostate cancer 
tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
prostate cancer cells, that have an associated prostate cancer expression profile. By 
5 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
10 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 

15 generated, as outlined herein. 

Thus, e.g., prostate cancer tissue may be screened for agents that modulate, 
e.g., induce or suppress the prostate cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on prostate 
cancer activity. By defining such a signature for the prostate cancer phenotype, screens for 

20 new drugs that alter the phenotype can be devised. With this approach, the drug target need 
not be known and need not be represented in the original expression screening platform, nor 
does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 

25 • differentially expressed gene as important in a particular state, screening of modulators of 

either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "prostate cancer proteins" 
or a "prostate cancer modulatory protein". The prostate cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 

30 acids of Tables 1-16. Preferably, the prostate cancer modulatory protein is a fragment. In a 
preferred embodiment, the prostate cancer amino acid sequence which is used to determine 
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sequence identity or similarity is encoded by a nucleic acid of Tables 1-16. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Tables 1-16. In another embodiment, the sequences are sequence variants as 
further described herein. 
5 Preferably, the prostate cancer modulatory protein is a fragment of 

approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 
C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 

10 coupling, i.e., to cysteine. 

In one embodiment the prostate cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the prostate cancer protein is 
conjugated to BSA. 

Measurements of prostate cancer polypeptide activity, or of prostate cancer or 

15 the prostate cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the function of the prostate cancer polypeptides can be 
measured by examining parameters described above. A suitable physiological change that 
affects activity can be used to assess the influence of a test compound on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 

20 animals, one can also measure a variety of effects such as, in the case of prostate cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 
release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 
northern blots), changes in cell metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 

25 prostate cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a prostate cancer polypeptide is first contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the prostate cancer polypeptide levels are determined in vitro by measuring the level of 

30 protein or mRNA. The level of protein is measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the prostate cancer 
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polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 
PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is detected using directly or indirectly 
labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
5 radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the prostate cancer 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or P-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
10 activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
15 differentially expressed genes are sometimes referred to herein as "prostate cancer proteins." 
The prostate cancer protein may be a fragment, or alternatively, be the full length protein to a 
fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
20 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
25 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the prostate cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a prostate 
30 cancer protein and a candidate compound, and determining the binding of the compound to 
the prostate cancer protein. Preferred embodiments utilize the human prostate cancer protein, 
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although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
prostate cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the prostate 
5 cancer protein or the candidate agent is non-diffusably bound to an insoluble support having 
isolated sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble 
supports may be made of any composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 
screening. The surface of such supports may be solid or porous and of any convenient shape. 

10 Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 
and samples. The particular manner of binding of the composition is not crucial so long as it 

15 is compatible with the reagents and overall methods of the invention, maintains the activity of 
the composition and is nondiffusable. Preferred methods of binding include the use of 
antibodies (which do not sterically block either the ligand binding site or activation sequence 
when the protein is bound to the support), direct binding to "sticky" or ionic supports, 
chemical crosslinking, the synthesis of the protein or agent on the surface, etc. Following 

20 binding of the protein or agent, excess unbound material is removed by washing. The sample 
receiving areas may then be blocked through incubation with bovine serum albumin (BSA), 
casein or other innocuous protein or other moiety. 

In a preferred embodiment, the prostate cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 

25 support and the prostate cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

30 protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The determination of the binding of the test modulating compound to the 
prostate cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the prostate cancer protein to a solid support, adding a labeled candidate agent (e.g., a 
5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
10 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a prostate cancer protein), such as an antibody, peptide, binding partner, 

15 ligand, etc. Under certain circumstances, there may be competitive binding between the 

compound and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

20 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 

25 compound. Displacement of the competitor is an indication that the test compound is binding 
to the prostate cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the prostate cancer protein. In this embodiment, either component can be 
labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 

30 presence of the label on the support indicates displacement. 



76 



WO 02/30268 



PCT/US01/32045 



In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the prostate cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the prostate cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the prostate cancer proteins. In 
this embodiment, the methods comprise combining a prostate cancer protein and a competitor 
10 in a first sample. A second sample comprises a test compound, a prostate cancer protein, and 
a competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the prostate cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
15 agent is capable of binding to the prostate cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native prostate cancer protein, but cannot bind to modified prostate cancer 
proteins. The structure of the prostate cancer protein may be modeled, and used in rational 
drug design to synthesize agents that interact with that site. Drug candidates that affect the 
20 activity of a prostate cancer protein are also identified by screening drugs for the ability to 
either enhance or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
25 protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
30 include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
77 



WO 02/30268 



PCT/US01/32045 



interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
5 compound capable of modulating the activity of a prostate cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising prostate cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a prostate cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

10 In one aspect, the assays are evaluated in the presence or absence or previous 

or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

15 In this way, compounds that modulate prostate cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the prostate cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

In one embodiment, a method of inhibiting prostate cancer cell division is 

20 provided. The method comprises administration of a prostate cancer inhibitor. In another 
embodiment, a method of inhibiting prostate cancer is provided. The method comprises 
administration of a prostate cancer inhibitor. In a further embodiment, methods of treating 
cells or individuals with prostate cancer are provided. The method comprises administration 
of a prostate cancer inhibitor. 

25 In one embodiment, a prostate cancer inhibitor is an antibody as discussed 

above. In another embodiment, the prostate cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

30 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example, 
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transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
5 modulators of prostate cancer sequences, which when expressed in host cells, inhibit 

abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells' ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
10 described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al. (1996), 
supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
15 they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
20 pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 

saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 Ff)-thymidine at saturation density is a 
25 preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a prostate cancer-associated sequence and are grown for 24 hours at 
saturation density in non-limiting medium conditions. The percentage of cells labeling with 
( 3 H)-thyrnidine is determined autoradiographically. See, Freshney (1994), supra. 
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Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
counterparts (see, e.g., Temin, J. Natl. Cancer Instu 37:167-175 (1966); Eagle et al, J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
5 factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
10 specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
15 counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 
Strickland & Beers, /. Biol. Chem. 251:5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
20 tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel -or some other extracellular matrix 
25 constituent can be used as an assay to identify compounds that modulate prostate cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 
30 Techniques described in Freshney (1994), supra, can be used. Briefly, the 

level of invasion of host cells can be measured by using filters coated with Matrigel or some 
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other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and distance 
moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

5 

Tumor growth in vivo 

Effects of prostate cancer-associated sequences on cell growth can be tested in 
transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the prostate cancer gene is disrupted or in which a prostate cancer gene is inserted. Knock- 

10 out transgenic mice can be made by insertion of a marker gene or other heterologous gene 
into the endogenous prostate cancer gene site in the mouse genome via homologous 
recombination. Such mice can also be made by substituting the endogenous prostate cancer 
gene with a mutated version of the prostate cancer gene, or by mutating the endogenous 
prostate cancer gene, e.g., by exposure to carcinogens. 

15 A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 

containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 

20 lesion {see, e.g., Capecchi et al, Science 244: 1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al, Manipulating the Mouse Embryo: A Laboratory Manual, 
Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 

25 can be used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella et al, J. 
Natl. Cancer Inst. 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley et al, Br. J. Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41:52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 

30 normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a prostate cancer-associated sequences are injected subcutaneously. After a 
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suitable length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or 
by its two largest dimensions) and compared to the control. Tumors that have statistically 
significant reduction (using, e.g., Student's T test) are said to have inhibited growth. 

5 Methods of identifying variant prostate cancer-associated sequences 

Without being bound by theory, expression of various prostate cancer 
sequences is correlated with prostate cancer. Accordingly, disorders based on mutant or 
variant prostate cancer genes may be determined. In one embodiment, the invention provides 
methods for identifying cells containing variant prostate cancer genes, e.g., determining all or 

10 part of the sequence of at least one endogenous prostate cancer genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the prostate cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one prostate cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 

15 of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced prostate cancer gene to a known prostate cancer 
gene, i.e., a wild-type gene. 

The sequence of all or part of the prostate cancer gene can then be compared 
to the sequence of a known prostate cancer gene to determine if any differences exist. This 

20 can be done using any number of known homology programs, such as Bestfit, etc. In a 
preferred embodiment, the presence of a difference in the sequence between the prostate 
cancer gene of the patient and the known prostate cancer gene correlates with a disease state 
or a propensity for a disease state, as outlined herein. 

In a preferred embodiment, the prostate cancer genes are used as probes to 

25 determine the number of copies of the prostate cancer gene in the genome. 

In another preferred embodiment, the prostate cancer genes are used as probes 
to determine the chromosomal localization of the prostate cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the prostate 

30 cancer gene locus. 
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Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a prostate cancer 
protein or modulator thereof, is administered to a patient. By "therapeutically effective dose" 
herein is meant a dose that produces effects for which it is administered. The exact dose will 
5 depend on the purpose of the treatment, and will be ascertainable by one skilled in the art 
using known techniques (e.g., Ansel et al, Pharmaceutical Dosage Forms and Drug 
Delivery; Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 
0824770846, 082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and 
Technology of Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations 

10 (1999)). As is known in the art, adjustments for prostate cancer degradation, systemic versus 
localized delivery, and rate of new protease synthesis, as well as the age, body weight, 
general health, sex, diet, time of administration, drug interaction and the severity of the 
condition may be necessary, and will be ascertainable with routine experimentation by those 
skilled in the art. U.S. Patent Application N. 09/687,576, further discloses the use of 

15 compositions and methods of diagnosis and treatment in prostate cancer is hereby expressly 
incorporated by reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 

20 preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the prostate cancer proteins and modulators thereof of 
the present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 

25 some instances, e.g., in the treatment of wounds and inflammation, the prostate cancer 
proteins and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a prostate 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

30 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
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biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

10 potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

15 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that prostate cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a prostate 
cancer protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an 

30 aqueous carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. 
These solutions are sterile and generally free of undesirable matter. These compositions may 
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be sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutical^ acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
5 sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

10 Thus, a typical pharmaceutical composition for intravenous administration 

would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 

15 preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 

The compositions containing modulators of prostate cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 

20 compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 

25 depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 
effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 

30 condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
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treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present prostate cancer protein-modulating 
5 compounds can be administered alone or in combination with additional prostate cancer 
modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-16, such as antisense polynucleotides 

10 or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of prostate cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 

15 expression of a protein or nucleic acid is application specific. Many procedures for 

introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 

20 e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed„ Vol. 1-3, 1989. 

In a preferred embodiment, prostate cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 

25 prostate cancer genes (including both the full-length sequence, partial sequences, or 

regulatory sequences of the prostate cancer coding regions) can be administered in a gene 
therapy application. These prostate cancer genes can include antisense applications, either as 
gene therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

30 Prostate cancer polypeptides and polynucleotides can also be administered as 

vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
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compositions can include, e.g., lipidated peptides {see, e.g.,Vitiello, A. et al, J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres {see, e.g., Eldridge, etal.,Molec. Immunol. 28:287-294, (1991); Alonso etal, 
Vaccine 12:299-306 (1994); Jones etal, Vaccine 13:675-681 (1995)), peptide compositions 
5 contained in immune stimulating complexes (ISCOMS) {see, e.g., Takahashi et al, Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) {see, e.g., Tarn, Proc. Natl. Acad. Set U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

10 vectors (Perkus, et al, In: Concepts in vaccine development (Kaufmann, ed., p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al, 
Virology 175:535 (1990)), particles of viral or synthetic origin {see, e.g., Kofler et al, J. 
Immunol. Methods. 192:25 (1996); Eldridge etal, Sem. Henmtol. 30:16 (1993); Falo et al, 

15 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 

Gupta efaZ., Vaccine 11:293 (1993)), liposomes (Reddy et ah, J. Immunol. 148:1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, etal, 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufmann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 

20 Immunol 12:923 (1994) and Eldridge et al, Sem. Hematol. 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

25 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

30 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
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polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or 
5 RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

10 cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 

15 vaccinia virus, e.g., as a vector to express nucleotide sequences that encode prostate cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 

20 described in Stover et al, Nature 351:456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein (see, e.g., Shata et al, 
Mol Med Today 6:66-71 (2000); Shedlock et al.,JLeukoc Biol 68:793-806 (2000); Hipp et 

25 al. , In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a prostate cancer gene or portion of a prostate cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a prostate cancer patient. 
The prostate cancer gene used for DNA vaccines can encode full-length prostate cancer 

30 proteins, but more preferably encodes portions of the prostate cancer proteins including 
peptides derived from the prostate cancer protein. In one embodiment, a patient is 
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immunized with a DNA vaccine comprising a plurality of nucleotide sequences derived from 
a prostate cancer gene. For example, prostate cancer-associated genes or sequence encoding 
subfragments of a prostate cancer protein are introduced into expression vectors and tested 
for their immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T 
5 cell responses. This procedure provides for production of cytotoxic T cell responses against 
cells which present antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the prostate cancer polypeptide encoded by the DNA 

10 vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment prostate cancer genes find use in generating 
animal models of prostate cancer. When the prostate cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the prostate cancer gene will also diminish or repress expression of the gene. Animal 

15 models of prostate cancer find use in screening for modulators of a prostate cancer-associated 
sequence or modulators of prostate cancer. Similarly, transgenic animal technology 
including gene knockout technology, e.g. as a result of homologous recombination with an 
appropriate gene targeting vector, will result in the absence or increased expression of the 
prostate cancer protein. When desired, tissue-specific expression or knockout of the prostate 

20 cancer protein may be necessary. 

It is also possible that the prostate cancer protein is overexpressed in prostate 
cancer. As such, transgenic animals can be generated that overexpress the prostate cancer 
protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 

25 can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of prostate cancer and are 
additionally useful in screening for modulators to treat prostate cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

30 For use in diagnostic, research, and therapeutic applications suggested above, 

kits are also provided by the invention. In the diagnostic and research applications such kits 
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may include any or all of the following: assay reagents, buffers, prostate cancer-specific 
nucleic acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative prostate cancer polypeptides or polynucleotides, small 
molecules inhibitors of prostate cancer-associated sequences etc. A therapeutic product may 
5 include sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
10 contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
15 prostate cancer-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: a prostate cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing prostate cancer-associated activity. Optionally, the kit contains 
biologically active prostate cancer protein. A wide variety of kits and components can be 
20 prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 

25 
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EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

5 Purifying total RNA from tissue sample using TRIzol Reagent 

The sample weight is first estimated. The tissue samples are homogenized in 
1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 

10 larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 
tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 

15 material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 
-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 

20 for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 

25 fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 
homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 
min. at 4°C. 
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The RNA is then washed. The supernatant is poured off and the pellet washed 
with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
5 minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
10 appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
15 Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 
incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
20 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 
satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
25 resuspended in Wash Buffer OW2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 
of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 
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The absorbance is next read to determine the yield, using diluted Elution 
Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
5 Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 
0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
10 The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 
without use of a speed vacuum and the pellet is then resuspended in DEPC H 2 0 at lug/ul 
concentration. 

Alternatively the RNA may be purified using other methods (e.g., Qiagen's RNeasy kit). 

15 No more than 100 ug is added to the RNeasy column. The sample volume is 

adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >10,000 rpm). If yield is low, reapply 
the flowthrough to the column and centrifuge again. 

20 Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 

RPE and centrifuge for 15 sec at >10,000rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >10,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
maximum speed. The column is transferred to a new 1.5 -ml collection tube. 30-50 ul of 

25 RNase-free water is applied directly onto column membrane. The column is then centrifuged 
for 1 min at >10,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

30 First Strand cDNA Synthesis 
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The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
5 should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0. 1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
hour. 

10 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 
ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 
Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
15 hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24:1) 

20 and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 

25 volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 

min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 
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In vitro Transcription (IVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1.5 ul of cDNA into a thin- wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 
5 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-1 1-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

10 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

15 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 

20 of the transcript size range. 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 

25 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0. 1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 
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IVT antisense RNA; 4 ug: ul 
Random Hexamers (1 jxg/jxl): 4 ul 
H 2 0: ul 
14 ul 

5 Incubate the above 14 ul mixture at 70°C for 10 min.; then put on ice. 

The Reverse transcription procedure uses the following mixture: 
0.1MDTT: 3 |xl 

50X dNTP mix: 0.6 ul 

H 2 0: 2.4 ul 

10 Cy3 or Cy5 dUTP (ImM): 3 jxl 

SS RT H (BRL): 1 ul 



The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 
15 Then, 1 ul SSII is added and incubated for another hour before being placed on ice. 

The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 

of dTTP and is made by adding 25 |J.l each of lOOmM dATP, dCTP, and dGTP; 10 ul of 

lOOmM dTTP to 15 ul H 2 0. ] 

RNA degradation is performed as follows. Add 86 ul H20, 1.5 ul 1M NaOH/ 
20 2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 ul TE/sample spin at 7000 g 

for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 

recovered material in 500 ul buffer PB and proceed using Qiagen protocol. For DNAse 

digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 

Incubate at 5 min 95°C to denature the DNAse. 

25 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 ul; 50X dNTPs, 1 ul; 20X SSC, 
2.3 ul; Na pyro phosphate, 7.5 ul; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 
21.8 final vol. Dry in speed vac. Resuspend in 15 ul H 2 0. Add 0.38 ul 10% SDS. Heat 
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95°C, 2 min and slow cool at room temp, for 20 min. Put on slide and hybridize overnight at 
64°C. Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X 
SSC+0.75ml 10% SDS in 250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 
0.2X SSC: 5 min., 2.5 ml 20X SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's 
5 and channels. 

Example 2: Taxol resistant Xenograft Model of Human Prostate Cancer 

Treatment regimens that include paclitaxel (Taxol; Bristol-Myers Squibb 
10 Company, Princeton, NJ) have been particularly successful in treating hormone-refractory 
prostate cancer in the phase II setting (Smith et al., Semin. Oncol. 26(1 Suppl 2):109-11 
(1999)). However, many patients develop tumors which are initially, or later become, 
resistant to taxol. To identify genes that may be involved with resistance to taxol, or are 
regulated in response to taxol resistance, and therefore may be used to treat, or identify, taxol 
15 resistance in patients, the following experiments were carried out. 

The androgen-independent human cell line CWR22R was grown as a 
xenograft in nude mice (Nagabhushan et al., Cancer Res. 56(13):3042-3046 (1996); Agus et 
al., J. Natl. Cancer Inst.9 1(21): 1869-1876 (1999); Bubendorf et al., J. Natl. Cancer Inst. 

20 91(20):1758-1764 (1999)). Initially, these xenograft tumors were sensitive to therapeutic 
doses of taxol. The mice were treated continuously with sub-therapeutic doses, and the 
tumors were allowed to grow for 3-4 weeks, before surgical removal of the tumors. The 
tumor from an individual mouse was then minced, and a small portion was then injected into 
a healthy nude mouse, establishing a second 

25 passage of the tumor. This mouse was then treated continuously with the 

same sub-therapeutic dose of taxol. This process was repeated 14 times, and a portion of 
each generation of xenograft tumor was collected. There was increasing resistance to 
therapeutic doses of taxol with each generation. Bythe end of the process, the tumors were 
fully resistant to therapeutic doses of taxol. RNA from each generation of tumor was then 

30 isolated, and individual mRNA species were quantified using a custom Affymetrix 

GeneChip® oligonucleotide microarray, with probes to interrogate approximately 35,000 
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unique mRNA transcripts. Genes were selected that showed a statistically significant up- 
regulation, or down-regulation, during the subsequent generations of increasingly taxol- 
resistant tumors. Only one gene was significantly up-regulated, whereas 24 genes were 
down-regulated; these are presented in Table 10. 
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The gene sequences identified to be overexpressed in prostate cancer may be 
used to identify coding regions from the public DNA database. The sequences may be used 

5 to either identify genes that encode known proteins, or they may be used to predict the coding 
regions from genomic DNA using exon prediction algorithms, such as FGENESH (Salamov 
and Solovyev, 2000, Genome Res. 10:516-522). In addition, one of ordinary skill in the art 
would understand how to obtain the unigene cluster identification and sequence information 
according to the exemplar accession numbers provided in Tables 1-16. (see, 

1 0 http ://www.ncbi.nlm.nih.gov/UniGene/) . 
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TABLE1 : shows genes, including expression sequence tags, differentially expressed in 
prostate tumor tissue compared to normal tissue as analyzed using the Affymetrix/Eos HuOl 
5 GeneChip array. Shown are the relative amounts of each gene expressed in prostate tumor 
samples and various normal tissue samples showing the highest expression of the gene. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1; Ratio of tumor to normal body tissue 
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115764 Hs.91011 AA421562 anterior gradient 2 (Xenopus laevis; sec 8.9 

119617 Hs.55999 W47380 ESTs 8.9 

100552 Hs.301946 HG2167-HT2237 Protein Kinase W31, Camp-Dependent B.9 

105627 Hs.23317 AA281245 ESTs 8.8 

5 101461 Hs.76422 M22430 phospholipase A2; group II A (platelets; 8.7 

131725 Hs.31146 AA456264 ESTs; Highly similar to (defline not ava 8.5 

124526 Hs.293185 NB2096 yz61c5.s1 Soares_multiple_sclerosis_2NbH 8.5 

118528 Hs.49397 N67889 ESTs 8.2 

133845 Hs.76704 T68510 ESTs 8.2 

10 133354 Hs.334762 AA055552 ESTs; Weakly similar to KIAA031 9 [H.sapi 8.1 

105912 Hs.20415 AA402000 ESTs; Weakly similar to GS3786 [H.sapien 8 

119018 Hs.278695 N95796 ESTs 8 

100394 Hs.66052 D84276 CD38 antigen (p45) 8 

114132 Hs.24192 Z38688 ESTs 7.9 

15 116786 Hs.301527 H25836 tumor necrosis factor (ligand) superfami 7.7 

106579 Hs.23023 M456135 ESTs 7.6 

128790 Hs.1O5700 AA291725 secreted frizzled-related protein 4 75 

114965 Hs.72472 AA250737 ESTs 7.4 

112033 Hs.22627 R43162 ESTs 7.1 

20 102398 U42359 Human N33 protein form 1 (N33) gene, exo 7 

101201 Hs.2256 L22524 matrix metalloproteinase 7 (matrilysin; 6.9 

109272 Hs.288462 AA195718 ESTs 6.9 

103145 Hs.169849 X66276 myosfn-binding protein C; slow-type 6.9 

101803 Hs.155691 M86546 pre-B-cell leukemia transcription factor 6.8 

25 120562 Hs.302267 AA2B0036 ESTs; Weakly similar to W01A6.C [C.elega 6.8 

109112 Hs.257924 AA1 69379 ESTs 6.8 

109795 Hs.326416 F10707 ESTs 6.7 

107532 Hs.173684 Z19643 ESTs; Weakly similar to (defline not ava 6.7 

130336 Hs.171995 X07730 kallikrein 3; (prostate specific antigen 6.6 

30 131425 Hs.26691 AA219134 ESTs 6.6 

120588 Hs.16193 AA281591 Homo sapiens mRN A; cDNA DKFZp586B21 1 (fr 6.6 

132902 Hs.59838 AA490969 ESTs 6.6 

125674 Hs.323378 W28078 H.sapiens mRNA for transmembrane protein 6.6 

133724 Hs.75746 U07919 aldehyde dehydrogenase 6 6.5 

35 130343 Hs.278628 AA490262 ESTs; Moderately similar to APXL gene pr 6.5 

120215 Hs.108787 Z41050 Homo sapiens Mcd4p homolog mRNA; complet 6.5 

129215 Hs.126085 AA176867 ESTs 6.5 

131881 Hs.3383 AA010163 upstream regulatory element binding prot 6.5 

133376 Hs.7232 T23670 ESTs 6.4 

40 105376 Hs.8768 AA236559 ESTs; Weakly similar to neuronal thread 6.4 

104674 Hs.26289 AA009527 ESTs 6.4 

100727 Hs.334786 X07290 Human HF.12 gene mRNA 6.3 

130150 Hs.15113 AF000573 homogentisate 1;2-dioxygenase (homogenti 6.3 

121770 Hs.278428 AA421714 Homo sapiens mRNA for KIAA0896 protein; 6.3 

45 1 23475 Hs.250528 AA599267 ESTs; Weakly similar to ANKYRIN; BRAIN V 6.3 

133061 Hs.296638 AB000584 prostate differentiation factor 6.3 

116429 Hs.279923 AA6O9710 ESTs; Weakly similar to similar to GTP-b 6.2 

101233 Hs.878 L29008 sorbitol dehydrogenase 6.2 

104691 Hs.37744 AA011176 ESTs 65 

50 127248 AA325029 EST27953 Cerebellum II Homo sapiens cDNA 6.2 

127775 Hs.179902 H04106 ESTs; Weakly similar to (defline not ava 6.2 

105500 Hs.222399 AA256485 ESTs 6.1 

131463 Hs.2714 X74142 forkhead (Drosophila)-like 1 - 6.1 

132116 Hs.40289 AA234767 ESTs 6 

55 130828 Hs.203213 AA053400 ESTs 5.9 

115357 Hs.72988 AA281793 ESTs 5.8 

105496 Hs.301997 AA256323 ESTs 5.7 

116334 Hs.48948 AA491457 ESTs 5.7 

107968 Hs.61539 AA034020 ESTs 5.7 

60 120132 Hs.125019 Z38839 ESTs; Weakly similar to III! ALU SUBFAMI 5.6 

106375 Hs.289072 AA443993 ESTs 5.6 

132550 Hs.1 70195 AA029597 bone morphogenetic protein 7 (osteogenic 5.6 

124777 Hs.140237 R41933 ESTs; Weakly similar to neuronal thread 5.6 

100311 Hs.337616 D5064O phosphodiesterase 3B; cGMP-inhibited 5.6 

65 101791 Hs.62354 M83822 Human beige-tike protein (BGL) mRNA; par 5.5 

117698 Hs.45107 N41002 ESTs 5.5 

132387 Hs.281434 R70914 heat shock 70kD protein 1 5.5 

122041 Hs.98732 AA431407 Homo sapiens Chromosome 16 BAC clone CIT 5.5 

133723 Hs.262476 AA088851 S-adenosylmethionine decarboxylase 1 5.5 

101 



WO 02/30268 



PCT/US0 1/32045 



113938 W81598 


ESTs 


5.4 


133015 Hs.246315 AA047036 


ESTs 


5.4 


125745 Hs.75722 AI283493 


ribophorin II 


5.4 


107295 Hs.80120 T34527 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


5.4 


108186 Hs.7780 AA056482 


ESTs 


5.3 


100184 Hs.21223 D17408 


calponin 1; basic; smooth musclo 


5.3 


104456 Hs.326392 N25110 


Human guanine nucleotide exchange factor 


5.3 


104033 Hs.98944 AA365031 


ESTs 


5.3 


110844 Hs.167531 N31952 


ESTs; Weakly similar to (defiine not ava 


5.3 


129056 Hs.1 08336 H70627 


ESTs; Weakly similar to !!!! ALU SUBFAMI 


5.3 


102805 Hs.25351 U90304 


iroquois-class homeodomain protein 


5.3 


133493 Hs.1 94339 AA284143 


Homo sapiens chromosome 1 atrophin-1 rel 


5.3 


129184 Hs.109201 W26769 


ESTs; Highly similar to (defiine not ava 


5.2 


134158 Hs.79428 U15174 


BCL2/adenovirus E1B 19kD-interacting pro 


5.2 


107240 Hs.159872 D59368 


ESTs 


5.2 


104787 AA027317 


ESTs; Weakly similar to !!!! ALU SUBFAMI 


5.2 


123527 Hs.108327 M608679 


damage-specific DN A binding protein 1 (1 


5.2 


116646 Hs.1 94228 F03048 


ESTs; Moderately similar to HI! ALU SUB 


5.2 


101448 Hs.195850 M21389 


keratin 5 (epidermolysis bullosa simplex 


5.1 


116188 Hs.184598 AA464728 


ESTs; Weakly similar to !!!! ALU SUBFAMI 


5.1 


126259 Hs.281428 Z21472 


ESTs; Moderately similar to HI! ALU SUB 


5.1 


105921 Hs.169119 AA402613 


ESTs 


5.1 


103375 Hs.54416 X91868 


sine oculis homeobox (Drasophila) homolo 


5.1 


128871 Hs.1 06778 M400271 


ESTs; Highly similar to (defiine not ava 


5.1 


112681 Hs.148932 R87331 


ESTs; Moderately similar to semaphorin V 


5.1 


105784 Hs.226434 AA350771 


ESTs 


5.1 


116238 Hs.47144 M479362 


ESTs 


5 


102913 Hs.80342 X07696 


keratin 15 


5 


103011 Hs.326035 X52541 
126023 H58881 


early growth response 1 

yr36d09.r1 Soares fetal liver spleen 1NF 


5 

5 


103709 Hs.13804 M037316 
118981 Hs.39288 N93839 


ESTs 

ESTs; Weakly similar to !!!! ALU SUBFAMI 


5 
5 


134807 Hs.89732 X78932 


zinc finger protein 273 


5 


100079 Hs.23311 AB002365 


Human mRNA for KIAA0367 gene; partial od 


4.9 


132047 Hs.3796 D83492 


EphB5 


4.9 


132880 Hs.1 77537 AA444369 


ESTs 


4.9 


124049 Hs.74519 F10523 


primase; polypeptide 2A (58kD) 


4.8 


133330 Hs.71119 U42360 


Human N33 mRNA; complete cds 


4.8 


104776 AA026349 


ESTs 


4.8 


122593 Hs.128749 M453310 


Homo sapiens alpha-melhylacyl-CoA racema 


4.8 


103912 Hs.143087 AA251078 


ESTs 


4.8 


113961 Hs.26009 W86307 


Homo sapiens mRNA for KIAA0860 protein; 


4.8 


105288 Hs.3585 AA233168 


ESTs; Weakly similar to coded for by C. 


4.8 


135035 Hs.284186 H89575 


ESTs 


4.8 


104144 Hs.183390 AA447439 


ESTs; Weakly similar to ZINC FINGER PROT 


4.8 


129389 Hs.288126 AA621604 


ESTs 


4.8 


125982 R98091 


RAE1 (RNA export 1 ; S.pombe) homolog 


4.8 


125162 Hs.26243 W44682 


ESTs 


4.8 


103023 Hs.1 17950 X53793 


multifunctional polypeptide similar to S 


4.7 


129735 W80701 


ESTs; Weakly similar to HERV-E envelope 


4.7 


104479 Hs.106390 N36040 


ESTs 


4.7 


103731 AA070545 


zm7c3.r1 Stratagene neuroepithelial) (#93 


4.7 


126575 Hs.127602 W72416 


ESTs 


- 4.7 


124578 Hs.231500 N6B321 


Human glucose transporter-like protein-l 


4.7 


130617 Hs.1674 M90516 


glutamine-fructose-6-phosphatetransamin 


4.7 


116752 Hs.91622 H06373 


Homo sapiens clone 24456 mRNA sequence 


4.7 


100279 Hs.82007 D42084 


Human mRNA for KIAA0094 gene; partial cd 


4.7 


126288 Hs.89576 AI479264 


ESTs 


4.7 


131836 Hs.32990 AA610086 


ESTs 


4.7 


106717 Hs.239489 AA465093 


TIA1 cytotoxic granule-associated RNA-bi 


4.7 


114542 Hs.91011 AA055768 


ESTs 


4.6 


103806 AA130614 


zo1f2,r1 Stratagene neuroepithelium NT2R 


4.6 


130529 AA173238 


small inducble cytokine A5 (RANTES) 


4.6 


115675 Hs.82065 AA406546 


ESTs 


4.6 


111386 Hs.293798 N95326 


ESTs 


4.6 


106503 Hs.29679 AA452411 


ESTs 


4.6 


119943 Hs.14158 W86835 


copine III 


4.6 


104459 Hs.100070 M91493 


EST 


4.6 


100774 Hs.89603 HG371-HT1063 


Mucin 1, Epithelial, AIL Splice 6 


4.6 
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100652 Hs.142653 HG2825-HT2949 Ret Transforming Gene 4.6 

132015 Hs.3731 D11900 ESTs 4.6 

126086 H70975 yr73g01 ,r1 Soares fetal liver spleen 1NF 4.6 

130888 Hs.173094 F03819 ESTs 4.6 

5 1 03390 Hs.20166 AA446964 Prostate stem cell antigen 4.6 

126959 Ml 99853 ESTs; Moderately similar to Hi! ALU SUB 4.5 

131584 Hs.291 17 X91648 Rsapiens mRNA for pur alpha extended 3' 4.5 

104838 Hs.20953 AA039481 ESTs 4.5 

125661 R50319 ESTs 4.5 

10 103171 Hs.234726 X68733 alpha-1-antichymotrypsin 4,5 

103928 Hs.199160 AA280085 ESTs 4.5 

102899 Hs.75730 X06272 signal recognition particle receptor fd 4.5 

100892 Hs.180789 HG4557-HT4962 Small Nuclear Ribonucleoprotein U1 , Isnr 4.5 

106167 Hs.7956 AA425906 ESTs 4.5 

15 129404 Hs.317584 AA172056 ESTs 4.5 

106990 Hs.24758 AA521354 ESTs 4.5 

132316 Hs.44566 U28831 Human protein immuno-reactive with anti- 4.4 

132056 Hs.38176 T89336 Homo sapiens mRNA for KIAA0606 protein; 4.4 

133718 Hs.198760 X15306 neurofilament; heavy polypeptide (200kD) 4.4 

20 101470 Hs.1846 M22898 tumor protein p53 (Li-Fraumeni syndrome) 4.4 

131904 Hs.284296 AA143019 ESTs; Highly similar to surface 4 integr 4.4 

105804 Hs.22514 AA383142 ESTs 4.4 

122861 Hs.119394 AA464428 ESTs 4.4 

111336 Hs.29894 N79565 ESTs 4.4 

25 121944 Hs.98518 AA429278 ESTs 4.4 

134401 Hs.21 1577 AA243746 ESTs; Highly similar to CG1 protein [H.s 4.4 

1 26458 Hs.288969 AA815252 ESTs; Weakly similar to !!!! ALU SUBFAMI 4.4 

133435 Hs.323966 T23983 ESTs; Moderately similar to Mil ALU SUB 4.4 

105178 Hs.21941 AA187490 ESTs 4.3 

30 127315 AA640834 nr27b06.r1 NCI_CGAP_Pr3 Homo sapiens cON 4.3 

132645 Hs.54424 X87870 H.sapiens mRNA for hepatocyte nuclear fa 4.3 

116162 Hs.282990 AA461487 ESTs; Weakly similar to F52C12.2 [C.eleg 4.3 

118040 Hs.47567 N52876 EST 4.3 

130008 Hs.278427 M31423 cerebellar degeneration-related protein 4.3 

35 126607 Hs.114688 W87424 ESTs 4.3 

123061 Hs.105130 AA482030 EST 4.3 

109391 Hs.184245 AA219699 ESTs 4.3 

109175 AA180496 ESTs 4.3 

127003 Hs.173540 AA550806 ESTs; Weakly similar to (dafline notava 4.3 

40 102547 Hs.46638 U57911 chromosome 11 open reading frame 8 4.3 

134208 Hs.79993 U88871 peroxisomal biogenesis factor 7 4.3 

104258 Hs.5462 AF007216 solute carrier family 4; sodium bicarbon 4.3 

130759 Hs.18946 AA094720 ESTs; Weakly similar to (defiine not ava 4.3 

132160 Hs.295923 AA281770 seven in absentia (Drosophila) homolog 1 4.3 

45 135062 Hs.93872 AA174183 ESTs 4.3 

126510 Hs.334762 R49702 ESTs; Weakly similar to KIAA0319 [H.sapi 4.2 

122055 Hs.98747 AA431732 EST 4.2 

133136 Hs.6574 AF007165 suppressin (nuclear deformed epidermal a 4.2 

109890 Hs.20843 H04649 ESTs 4.2 

50 133294 Hs.69997 R79723 H.sapiens mRNA for translin associated z 4.2 

134436 Hs.83190 S80437 fatty acid synthase {3' region} [human, 4.2 

107375 Hs.251064 U88573 NBR2 4.2 
122223 Hs.27413 AA436158 ESTs - 4.2 
103044 Hs.248210 X55777 H.sapiens Mahlavu hepatocellular carcino 4.2 

55 120125 Hs.59815 W9S362 EST 4.2 

1 28969 Hs.283978 T65327 ESTs; Highly similar to (defiine not ava 4.2 

129637 Hs.1179 D90359 TATA box binding protein (TBP)-associate 4.2 

106566 AA455921 ESTs; Weakly similar to !!!! ALU SUBFAMI 4.2 

112605 Hs.29852 R79220 ESTs 4.2 

60 103364 Hs.279929 X90B72 H.sapiens mRNA for gp25L2 protein 4.2 

132811 Hs.57419 U25435 transcriptional repressor 4.2 

126570 Hs.326292 T79274 ESTs 4.2 

116298 Hs.94109 AA489046 ESTs 4.2 

103024 Hs.105938 X53961 lactotransferrin 4.1 

65 129133 Hs.108850 R56728 yg95c6.M Soares infant brain 1 NIB Homo 4.1 

133167 Hs.6641 N98707 kinesin family member 5C 4.1 

126871 Hs.14051 AA351779 ESTs 4.1 

132333 Hs.45032 AA192157 ESTs 4.1 

107376 Hs.327179 U90545 solute carrierfamily 17 (sodium phospha 4.1 
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128517 Hs.100361 AA2B0617 


ESTs; Weakly similar to p60 katanin [H.s 




130555 Hs.116774 AA450324 


ESTs 


4.1 


105765 Hs.24183 AA343514 




4.1 


126529 Hs.26369 AA133237 
125928 Hs.181889 H29730 




4.1 
4.1 


117280 Hs!l72129 N22107 


ESTs; Moderately similar to III! ALU SUB 


4.1 


100234 Hs.3085 D29677 


KIAA0054 gene product 


4.1 


100959 Hs.118127 J00073 


actin; alpha; cardiac muscle 


4.1 


107130 Hs.12913 AA620582 
105035 Hs.8859 AA128486 


ESTs; Weakly similar to (defline not ava 


4.1 


126735 Hs.226795 AA808949 


glutathione S-transferase pi 


4.1 


113056 Hs.8036 T26471 


ESTs; Moderately similar to HI! ALU SUB 




102460 Hs.211582 U48959 


Homo sapiens myosin light chain kinase ( 




106968 Hs.26813 AA504631 


ESTs; Weakly similar to (defline not ava 


4 


123107 Hs.104207 AM86071 


ESTs 




127256 Hs.267967 AA327550 


ESTs; Weakly similar to IIH ALU SUBFAMI 


4 


105329 Hs.22862 AA234561 




4 


115504 Hs.42736 AA291946 


ESTs 


4 


120726 Hs.97293 AA293656 
103576 Hs.94560 Z26317 


desmogleln2 


4 


127889 Hs.144941 AI147408 




4 


106394 Hs.25320 AA447223 


ESTs 


4 


128046 AA873285 


ESTs 




103391 Hs.114366 X94453 


pyrroline-5-carboxylate synthetase (glut 




106448 Hs.27004 AA449455 


ESTs 


4 


126513 Hs.86276 W27601 


ESTs; Moderately similar to (defline not 


4 


129593 Hs.98314 AA4B7015 


ESTs; Weakly similar to !!!! ALU SUBFAMI 


3.9 


110151 Hs.31608 H18836 


ESTs 


3.9 


105344 Hs.8645 AA235303 


ESTs 


3.9 


104791 Hs 301871 AA029046 




3.9 


123442 Hs.111496 AA598803 


ESTs 


3.9 


127800 Hs.79428 AA521047 


BCL2/adenovirus E1B 19kD-interacting pro 


3.9 


114555 Hs. 167904 AA058594 


ESTs 


3.9 


122138 Hs.163960 AA435549 




3.9 


129565 Hs.198726 X77777 


vasoactive intestinal peptide receptor 1 


3.9 


103471 Hs.75216 Y00815 


raWesrnon 1 ph05phataSe ' recefltor ' 


3.9 


133908 Hs.325474 M83216 




3.9 


105635 Hs.301985 AA281508 


ESTs 


3.9 


134285 Hs.81086 AA460012 


solute carrier family 22 (organic cation 


3.9 


134125 Hs.50421 R38102 




3.9 


125628 Hs£41493 AM18069 


natural killer-tumor recognition sequenc 


3.9 


103695 Hs.18660O AA018758 




3,9 


100642 Hs.182183 HG2743-HT3926 


Caldesmon 1 , Alt. Splice 6, Non-Muscle 


3.9 


104334 Hs.78771 D82614 


ESTs 


3.9 


110242 Hs.19978 H26417 




3.9 


125298 Hs.289008 Z39255 




3.9 


104060 Hs.303193 AA39796B 


zt87a9.r1 Soares_testis_NHT Homo sapiens 


3.9 


105823 H&293960 AA398197 




3.9 


126499 Hs 110445 AA315671 


ESTs; Moderately similar to unknown [M.m 


3.9 


130752 Hs.18895 D50927 


KIAA0137 gene product 


3.8 


123494 Hs.112110 AA599786 




3.8 


104846 Hs.32478 AA040154 




3.8 


108921 Hs.71721 M142913 




. 3.8 


115506 Hs.45207 AA292537 


ESTs 


3.8 


100452 Hs.241552 D87742 


Human mRNA for KIAA0268 gene; partial cd 


3.8 


104454 Hs.1 29228 M84443 




3.8 


108730 Hs.102859 AA126254 


ESTs 


3.8 


131223 Hs.24427 AA247788 


ESTs; Highly similar to (defline not ava 


3.8 


104784 Hs.269228 M027055 


ESTs 


3.8 


104946 Hs.73848 M069549 


ESTs 


3.8 


106932 Hs.9394 AA495926 


ESTs 


3.8 


101724 Hs.620 M69225 


bullous pemphigoid antigen 1 (230/240kD) 


3.8 


106140 Hs.14912 AA424524 


Homo sapiens mRNA for KIAA0286 gene; par 


3.8 


128135 Hs.269721 AA913491 


ESTs 


3,8 


120030 Hs.58694 W92051 


ESTs 


3.8 


126457 Hs.50382 M007489 


zh98g04.r1 SoaresJetalJiveLspleenJ NF 


3.8 


123917 Hs.1 12969 M621311 


EST 


3.7 


110714 Hs.17752 H95978 


Homo sapiens phosphatidylserine-spedfic 


3.7 


130577 Hs.162 M35410 


Insulin-like growth factor binding prote 


3.7 
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117667 Hs.44708 N3S214 ser-Thr protein kinase related to the my 3.7 

126104 Hs.39712 N77278 ESTs; Weakly similar to BONE/CARTILAGE P 3.7 

100379 Hs.278721 D82060 Homo sapiens mRNA for membrane protein w 3.7 

115646 HS305971 AM04352 ESTs 3.7 

5 125792 Hs.193700 AI005388 ESTs; Moderately similar to !!!! ALU SUB 3.7 

102162 Hs.1592 U18291 CDC16 (cell division cycle 16; S.cerevi 3.7 

128530 Hs.183475 AA504343 ESTs; Moderately similar to ill! ALU SUB 3.7 

119940 Hs.272531 W86779 EST 3.7 

110769 Hs.23837 N22222 yw34b06.s1 Morton Fetal Cochlea Homo sap 3.7 

10 132914 Hs.60293 AA496037 ESTs 3.7 

113594 Hs.15683 T92030 ESTs 3.7 

103702 Hs.279952 AA027793 ESTs; Highly similar to (defline not ava 3.7 

130730 Hs.19347 AA248406 ESTs 3.7 

123288 Hs.291025 AA495836 EST 3.7 

15 120691 Hs.22380 AA291173 ESTs 3.7 

103153 Hs.75295 X66534 guanylate cyclase 1; soluble; alpha 3 3.7 

129201 Hs.109390 H199B9 ESTs 3.7 

114798 Hs.54900 AA159181 ESTs 3.7 

126801 Hs.7337 AA512902 ESTs 3.7 

20 105503 Hs.31707 AA256616 ESTs 3.7 

104260 Hs.194283 AF00B192 Homo sapiens putative GR6 protein (GR6) 3.7 

125930 Hs.35699 R97219 ESTs 3.7 

123255 Hs.105273 AA490890 ESTs 3.6 

103862 Hs.6363 AA206625 ESTs 3.6 

25 100696 Hs.121686 HG3162-HT3339 Transcription Factor lia 3.6 

134917 Hs.166994 X87241 FAT tumor suppressor (Drosophila) homolo 3.6 

103520 Y10511 H.sapiens mRNA for CD176 protein 3.6 

113778 Hs.302738 W15263 ESTs 3.6 

101838 Hs.75511 M92934 connective tissue growth factor 3.6 

30 113702 T97307 ESTs; Moderately similar to ill! ALU SUB 3.6 

118201 Hs.48428 N59800 EST 3.6 

116519 Hs.68554 C20780 EST 3.6 

105866 Hs.22983 AA400517 ESTs; Moderately similar to UDP-GLUCOSE: 3.6 

106709 Hs.170291 AA464696 ESTs 3.6 

35 127858 Hs.27973 AA806365 oc26h07.s1 NC1_CGAP_GCB1 Homo sapiens cD 3.6 

101964 S81578 dioxin-responsive gene {putative polyade 3.6 

105508 Hs.326416 AA256680 ESTs 3.6 

116844 Hs.337434 H64938 ESTs 3.6 

105372 Hs.142296 AA236481 ESTs 3.6 

40 100745 Hs.144630 HG3510-HT3704 V-Erba Related Ear-3 Protein 3.6 

127521 Hs.164018 AA809982 ESTs 3.6 

110758 Hs.274265 N21365 talin 3.6 

107307 Hs.44155 T52099 creatine kinase; mitochondrial 2 (sarcom 3.6 

133200 Hs.183639 AA432248 ESTs 3.6 

45 114774 Hs.164325 AA150043 ESTs 3.6 

120265 Hs.270696 AA173759 ESTs; Moderately similar to !!!! ALU SUB 3.6 

134359 Hs.199067 M34309 v-erb-b2 avian erythroblastic leukemia v 3.6 

116250 Hs.44829 AA480975 ESTs; Moderately similar to III! ALU SUB 3.6 

106313 Hs.35841 AA436459 nuclear factor l/X (CCAAT-binding transo 3.6 

50 131898 Hs.279780 N52232 ESTs 3.6 

133444 Hs.73793 M27281 vascular endothelial growth factor 3.6 

128232 Hs.334641 H06296 ESTs 3.6 

135357 Hs.79572 AA235803 ESTs - 3.5 

457951 AI369384 arylsulfatase D 3.5 

55 108407 AA075519 zm87h9.s1 Stratagene ovarian cancer (#93 3.5 

126S59 T16245 a disintegrin and metaifoproteinase doma 3.5 

104189 Hs.301804 AA485805 ESTs 3.5 

125956 Hs.129014 N53276 ESTs 3.5 

103026 Hs.79386 X54162 HumanmRNAfora64Kdautoantigenexpre 3.5 

60 133011 Hs.171921 AA042990 sema domain; immunoglobulin domain (tg); 3.5 

131379 Hs.26176 R49035 ESTs 3.5 

126742 Hs.169359 H64106 yr57e06.r1 Scares fetal liver spleen INF 3.5 

105560 Hs.306915 AA262783 ESTs 3.5 

118472 Hs.42179 N66818 ESTs 3.5 

65 105623 Hs.30127 AA280895 ESTs; Highly similarto !!!! ALU SUBFAMI 3.5 

120262 Hs.145807 AA172076 ESTs; Moderately similar to ill! ALU SUB 3.5 

105027 Hs.26771 AA126472 ESTs 3.5 

130760 Hs.18953 AA128997 phosphodiesterase 9A 3.5 

117473 Hs.155560 N30157 ESTs 3.5 
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102663 Hs.168075 U70322 karyopherin (importin) beta 2 3.5 

126349 Hs.13531 M442868 ESTs; Weakly similar to (defline not ava 3.5 

132154 Hs.41119 N67179 ESTs 3.5 

131689 Hs.30695 AA599653 transcription factor-like 5 (basic helix 3.5 

5 127862 Hs.163191 AA765305 EST 3.5 

126995 Hs.189810 W26950 Human DNA sequence from PAC388M5 on chr 3.5 

119071 R31180 ESTs 3.5 

103941 Hs.96593 AA282973 ESTs 3.5 

110721 Hs.31319 H97678 ESTs 3.5 

10 126586 Hs.43086 AA011247 ESTs 3.5 

103106 Hs.1857 X62025 phosphodiesterase 6G; cGMP-specific; rod 3.5 

116357 Hs.90797 AA504806 Homo sapiens clone 23620 mRNA sequence 3.5 

105309 Hs.4104 AA233790 ESTs 3.5 

130796 Hs.19525 R39390 ESTs 3.5 

15 109101 Hs.52184 M167708 ESTs 3.5 

103134 Hs.2839 X65724 Norrie disease (pseudoglioma) 3.5 

131798 Hs.301449 X86098 adenovirus 5 E1 A binding protein 3.5 

118535 Hs.49418 N67968 ESTs 3.5 

102592 Hs.11223 U62389 Human putative cytosolic NADP-dependent 3.4 

20 125905 Hs.6456 T69868 chaperonin containing TCP1;subunit2 (b 3.4 

109160 Hs.301997 AA179387 ESTs 3.4 

105327 Hs.211593 AA234440 ESTs 3.4 

106586 Hs.57787 AA456598 ESTs 3.4 

122635 AA454085 EST 3.4 

25 132413 Hs.2601 16 M132969 metalloprotease 1 (pitrilysin family) 3.4 

131938 Hs.34956 AA283620 ESTs 3.4 

133871 Hs.182793 AA454597 ESTs 3.4 

107175 Hs.292503 AA621 751 ESTs; Weakly similar to KIAA0601 protein 3.4 

101188 Hs.184298 L20320 cyclin-dependentkinase7(homologofXe 3.4 

30 126422 Hs.237658 H48518 ESTs; Highly similar to apolipoprotein A 3.4 

118475 N66845 ESTs; Weakly similar to !!!! ALU CLASS B 3.4 

104558 Hs.88959 R56678 ESTs; Weakly similar to !!!! ALU SUBFAMI 3.4 

128307 Hs.132005 AI453794 ESTs 3.4 

112254 Hs.25829 R51831 ESTs 3.4 

35 125408 Hs.B9578 N72353 yv37e12.r1 Scares fetal liver spleen 1NF 3.4 

109834 Hs.175955 H00604 ESTs 3.4 

130844 Hs.20191 D12122 seven in absentia (Drosophila) homolog 2 3.4 

127143 Hs.20843 AA533553 nj68h04.s1 NCI_CGAP_Pr10 Homo sapiens cD 3.4 

135309 Hs.42500 D25984 ESTs 3.4 

40 125724 Hs.295978 AA083407 stimulated trans-acting factor (50 kDa) 3.4 

127692 Hs.187983 AI021912 ESTs 3.4 

116674 Hs.92127 F04816 ESTs 3.4 

134700 Hs.8868 M481414 golgi SNAP receptor complex member 1 3.4 

114846 Hs.166196 AA234929 ESTs 3.4 

45 103649 Hs.155983 Z70219 H.sapiens mRNA for 5'UTR for unknown pro 3.4 

134B35 Hs.89925 L04569 calcium channel; voltage-dependent; Lty 3.4 

130568 Hs.16085 AA232535 ESTs; Highly similar to (defline not ava 3.4 

111331 Hs.15978 N78773 ESTs 3.4 

106036 Hs.10653 AA412505 ESTs 3.4 

50 130987 Hs.21893 R45698 ESTs 3.4 

112814 Hs.35828 R98192 ESTs 3.4 

127815 Hs.255015 AA876009 Ob93c10.s1 NCI_CGAP_GCB1 Homo sapiens cD 3.4 

100144 Hs.75616 D13643 KIAA0018 gene product • 3.4 

101129 Hs.247992 L10405 Homo sapiens DNA binding protein for sur 3.4 

55 130874 Hs.20621 T08287 ESTs 3.4 

106882 HS56994 AA489009 ESTs 3.4 

103855 Hs.302267 AA195179 ESTs 3.4 

125957 H45213 yo03b08.r1 Soares adult brain N2b5HB55Y 3.3 

114048 Hs.146085 W94613 ESTs 3.3 

60 109826 Hs.75354 F13702 ESTs 3.3 

125355 Hs.170098 R45630 ESTs; Highly similar to KIAA0372 [H.sapi 3.3 

104182 Hs.143792 AA479990 ESTs; Weakly similar to glioma amplified 3.3 

100294 Hs.75454 D49396 Human mRNA for Apo1_Human (MER5(Aop1-Mou 3.3 

131688 Hs.30692 U24153 p21 (CDKN1A)-activated kinase 2 3.3 

65 116255 Hs.88201 AA481256 ESTs; Weakly similar to (defline not ava 3.3 

102034 Hs.230 U05291 fibromodulln 3.3 

130072 Hs.14658 R99606 Human chromosome 5q13.1 done 5G8 mRNA 3.3 

114615 Hs.159456 AA083812 ESTs; Highly similar to (defline not ava 3.3 

128707 Hs.104105 AA136474 Meis (mouse) homolog 2 3.3 
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115048 Hs.1 90057 AA252668 ESTs 3.3 

125862 Hs.31110 H12Q84 ESTs 3.3 

135142 Hs.24192 R31679 ESTs 3.3 

103119 Hs.2877 X63829 oadhDrin 3; P-cadherin (placental) 3.3 

5 104460 Hs.62604 M91504 ESTs 3.3 

100365 Hs.79284 D78611 mesoderm specific transcript (mouse) horn 3.3 

131524 Hs.301804 N39152 ESTs 3.3 

102165 Hs.159627 U18321 Death associated protein 3 3.3 

126966 Hs.182575 R38438 solute carrierfamily 15 (H+/peptide tra 3.3 

10 124839 Hs.140942 R55784 ESTs 3.3 

100709 Hs.1 00469 HG3264-HT3441 Af-6 (Gb:U02478) 3.3 

132967 Hs.61635 AA032221 Homo sapiens BAC clone RG041D11 from 7q2 3.3 

102927 Hs.65114 X12876 keratin 18 3.3 

132616 Hs.283558 AA386264 ESTs 3.3 

15 125132 Hs.129781 W15495 ESTs 3.3 

111225 Hs.31652 N68989 ESTs 3.3 

114956 Hs.87113 AA243681 ESTs 3.3 

122235 Hs.1 12227 AA436475 ESTs 3.3 

112325 Hs.12315 R56055 ESTs 3.3 

20 123360 Hs.178604 AA504784 ESTs 3.3 

105150 Hs.155995 M169540 Homo sapiens mRNA for KIAA0643 protein; 3.3 

107391 Hs.284294 W02877 ESTs 3.3 

113058 Hs.7569 T26893 EST 3.3 

134371 Hs.82318 S69790 Bmsh-1 3.3 

25 125669 Hs.333256 R51308 ESTs; Moderately similar to !!!! ALU SUB 3.3 

111506 Hs.294105 R07726 ESTs 3.3 

122974 Hs.1 94215 AA478625 ESTs 3.3 

102369 Hs.299867 U39640 hepatocyte nuclear factor 3; alpha 3.3 

120408 Hs.190151 AA235045 ESTs 3.3 

30 117993 Hs.47402 N52039 ESTs; Weakly similarto !!!! ALU SUBFAMI 3.3 

129586 Hs.1 1500 AA437118 ESTs 3.3 

128138 Hs.126494 AI200825 ESTs 3.3 

127265 AA332751 EST37214 Embryo, 8 week I Homo sapiens c 3.3 

107674 Hs.41143 AA011027 Homo sapiens mRNA for KIAA05B1 protein; 3.2 

35 104866 Hs.293691 AA045342 ESTs 3.2 

103427 Hs.250655 X97303 H.sapiens mRNA for Ptg-12 protein 3.2 

132990 Hs.334334 AA458761 ESTs 22 

127017 Hs.251946 AA740146 ESTs 3.2 

132313 Hs.44481 U13220 forkhead (Drasophila)-like 6 3.2 

40 106880 Hs.32425 AA488889 ESTs 3.2 

107039 Hs.1 69780 AA599751 homologous to yeast nitrogen permease (o 3.2 

120B70 Hs.292581 AA357172 ESTs 3.2 

107920 Hs.284207 AA027951 ESTs 3.2 

104165 Hs.105116 AA459160 EST 32 

45 107012 Hs.63908 AA598745 ESTs 32 

103605 Hs.194657 Z35402 H.sapiens gene encoding E-cadherin, exon 32 

124006 Hs.270016 D60302 ESTs 3.2 

101300 Hs.74137 L40391 Homo sapiens (clone s153) mRNA fragment 3.2 

101183 Hs.795 L19779 H2Ahistone family; member 0 3.2 

50 125596 R25693 yg44h1 1 .r2 Soares infant brain 1 NIB Homo 32 

127261 AA661567 nu86b02.s1 NCI_CGAP_Alv1 Homo sapiens cD 3.2 

120090 Hs.59554 W94591 ESTs 3.2 

129393 Hs.166982 D13435 phosphatidylinositol glycan; class F - 3.2 

120923 Hs.97129 AA382283 ESTs 3.2 

55 118907 Hs.274256 N91003 ESTs 3.2 

111552 Hs.191185 R09411 ESTs 3.2 

104431 Hs.99913 J03019 adrenergic; beta-1-; receptor 3.2 

133551 Hs.278634 D63480 Human mRNA for KIAA0146 gens; partial cd 3.2 

131615 Hs.192803 D14533 xeroderma pigmentosum; complementation g 3.2 

60 126547 Hs.84072 U47732 transmembrane 4 superfamily member 3 3.2 

103172 Hs.1 16774 X68742 integrin; alpha 1 3.2 

113867 Hs.24095 W68845 ESTs 32 

133323 Hs.70937 Z83735 H3 histone family; member K 3.2 

111597 Hs.189716 R11499 ESTs 3.2 

65 121515 Hs.104696 AA412133 ESTs 3.2 

107445 Hs.6639 W28406 ESTs 3.2 

106887 Hs.334335 AA489091 ESTs 32 

123052 Hs.185766 AA481806 ESTs 32 

107072 Hs.1 30760 AA6091 13 Homo sapiens mRNA; cDNA DKFZp586N0318 (f 32 
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102214 Hs.32854 U23752 SRY (sex-determining region Y)-box 11 3.2 

123147 M487961 ab11h6.s1 Stratagene lung (#93721) Homo 3.2 

125435 Hs.272138 R00940 ye87g03.r1 Soares fetal liver spleen 1NF 32. 

116246 Hs.250646 M479961 ESTs; Highly similar to ublquitin-conjug 3.2 

5 105169 Hs.180789 AA180321 Homo sapiens (clone S164) mRNA; 3' end o 3.2 

134001 Hs.78344 AF001548 myosin; heavy polypeptide 11; smooth mus 3.2 

124866 Hs.304389 R68571 ESTs 3.2 

133205 Hs.67619 AA089559 Homo sapiens mRNA; chromosome 1 specific 3.2 

102986 Hs.182378 X17648 colony stimulating factor 1 (macrophage) 3.2 

10 101232 HS.242B94 L28997 ADP-ribosylation factor-like 1 3.1 

132906 Hs.234896 AA142857 ESTs; Highly similar to geminin [H.sapie 3.1 

104281 Hs.5669 C14290 ESTs 3.1 

123926 Hs.227933 AA621 348 ESTs; Highly similar to (defline not ava 3.1 

134464 Hs.239720 N79354 ESTs; Weakly similar to Rga [D.melanogas 3.1 

15 105322 Hs.16346 AA234100 ESTs 3.1 

100631 Hs.48332 HG2709-HT2805 Serine/Threonine Kinase (Gb225431) 3.1 

130791 Hs.199263 AA259102 ESTs; Highly similar to (defline not ava 3.1 

131220 Hs.300355 R77200 ESTs 3.1 

113237 Hs.123642 T62857 ESTs 3.1 

20 125562 Hs.98968 AI494372 ESTs 3.1 

134110 Hs.79136 U41060 Human breast cancer; estrogen regulated 3.1 

132393 Hs.47334 W85888 ESTs; Moderately similar to !!!! ALU SUB 3.1 

1 07439 Hs.296842 W27995 ESTs; Moderately similar to non-muscle m 3.1 

125863 Hs.40719 AA299096 Homo sapiens mRNA; cDNA DKFZp564M0916 (I 3.1 

25 105811 Hs.286192 AA394121 ESTs 3.1 

129284 Hs.296141 AA104023 ESTs 3.1 

125321 Hs.178294 T86652 ESTs 3.1 

107332 Hs.183297 T87750 ESTs 3.1 

123570 Hs.109653 AA608955 ESTS 3.1 

30 100384 Hs.90800 D83646 matrix metalloproteinase 16 (mambrane-in 3.1 

109063 Hs.38972 AA161043 tetraspan 1 3.1 

133284 Hs.182828 U09367 zinc finger protein 136 (clone pHZ-20) 3.1 

131839 Hs.33010 H80622 Homo sapiens mRNA for KIAA0633 protein; 3.1 

117606 Hs.44698 N35115 ESTs 3.1 

35 41899B Hs.287849 F13215 ESTs 3.1 

125180 Hs.103120 W58344 ESTs 3.1 

100789 HG3893-HT4163 Phosphoglucomutase 1, Alt. Splice 3.1 

126017 Hs.159440 H60487 ESTs 3.1 

132452 Hs.247324 AA005262 Homo sapiens DNA sequence from PAC 262D1 3.1 

40 129077 Hs.108479 H78886 ESTs 3.1 

126563 Hs.181368 W26247 U5 snRNP-specific protein (220 kD); orth 3.1 

129650 Hs.118258 N52654 ESTs 3-1 

123465 AA599033 ESTs 3.1 

126486 Hs.152316 AA345339 EST51 345 Gall bladder II Homo sapiens cD 3.1 

45 126460 Hs.167031 W01616 za36d05.r1 Soares fetal liver spleen 1NF 3.1 

118697 Hs.43234 N72094 ESTs 3.1 

103860 Hs.38057 AA203742 ESTs 3.1 

1279S8 Hs.124347 AA971439 ESTs 3.1 

124984 Hs.223241 T47566 yb15c11.s1 Stratagene placenta (#937225) 3.1 

50 103903 Hs.15220 AA249334 j312.seq.F Human fetal heart, Lambda ZAP 3.1 

106697 Hs.22242 AA463737 ESTs 3.1 

130892 Hs.20993 AA442604 ESTs; Weakly similar to Ydr374cp [S.cere 3 

114032 Hs.35014 W92779 ESTs - 3 

128835 Hs.106390 W15528 ESTs 3 

55 103667 Hs.247815 Z80788 Ksapiens H4/I gene 3 

126264 Hs.250614 N42897 yy13h06.r1 Soares melanocyte 2NbHM Homo 3 

132626 HS21275 D25755 ESTs 3 

131107 Hs.75354 N87590 ESTs 3 

126780 Hs.5811 R12421 ESTs 3 

60 127363 Hs.22116 AA307744 Homo sapiens Cdc14B1 phosphatase mRNA; c 3 

103690 Hs.84063 AA016186 ESTs 3 

102589 Hs.8867 U62015 Homo sapiens Cyrol mRNA, complete cds 3 

125144 HS24336 W37999 ESTs 3 

132977 Hs.301404 U28686 RNA binding motif protein 3 3 

65 120714 Hs.146170 AA292689 ESTs 3 

101038 Hs.79411 J05249 replication protein A2 (32kD) 3 

102856 Hs.248177 X00090 Human histone H3 gene 3 

105516 Hs.30738 AA257971 ESTs 3 

131137 Hs.33287 U85193 nuclear factor l/B 3 
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113801 H 

133780 H 

104690 F 

126371 h 

127635 H 

128434 F 

435761 h 



127402 Hs.227949 AA358869 

117483 Hs.44189 N30426 

123659 Hs.1 12699 AA609368 

103963 Hs.63290 AA298588 

103795 Hs.7367 M1 12222 

115092 Hs.80975 AA255903 

134831 Hs.89890 S72370 

128579 Hs.101810 AA093378 

134193 Hs.7980 F09570 

123522 Hs.1 12575 AA608577 

107109 Hs.32793 AA609943 



1 AI354332 ESTs 
R26708 ESTs 
U261 74 granzyme K (serine protease; granzyme 3; 

HG2705-HT2602 S ' ~ ' " 



heterogeneous nuclear ribonucleoprotein 
proteasome (prosome; macropain) subunit; 
Homo sapiens chromosome 2; 10 repeat reg 
ESTs 

ESTs; Moderately similar to !!!! ALU SUB 
Homo sapiens mRNA; cDNA DKFZp586l1518 (f 
thymus specific serins peptidase 
ESTs 

ESTs; Highly similar to SEC13-RELATED PR 

ESTs 

ESTs 

EST114219 HSC172 cells II Homo sapiens c 
ESTs; Moderately similar to (defline not 
CD39-like 4 
pyruvate carboxylase 

ESTs; Weakly similar to !!!! ALU SUBFAMI 

ESTs 

ESTs 

ESTs 

histone deacetylase 1 
tumor rejection antigen (gp96) 1 
H. sapiens RNA for CLCN3 
ESTs 

zn13e12.s1 Stratagene hNT neuron (#93723 
Nucleic Acid-Binding Protein (Gb:L12693) 
ESTs; Weakly similar to weakly similar t 
replication factor C (activator 1 ) 2 (40 
ESTs 



in kin 



114546 
105914 
108552 
126505 
134098 
129721 
100076 
117466 
106335 
134510 



Hs.211539 
Hs.277422 
Hs.44104 
Hs.36688 



stromal cell-derived factor 1 
ESTs; Weakly similar to (defline not ava 
peroxisomal biogenesis factor 6 
ESTs; Moderately similar to till ALU SUB 
Homo sapiens growth arrest and DNA-damag 
zn11c7.s1 Stratagene hNT neuron (#937233 
16a11 Human retina cDNA randomly primed 
Human MRL3 mRNA for ribosomal protein L3 
eukaryotic translation initiation factor 
Homo sapiens mRNA for cadherin FIB3, par 



ESTs; Moderately similar to WAP four-dis 
Hs.250870 U25265 protein kinase; mitogen-activated; kinas 

Hs.32995 AA398412 ESTs 

Hs.26267 AA458904 ESTs; Weakly similar to torsinA [H.sapie 
Hs.173824 U51166 thymine-DNAglycosylase 
Hs.182183 HG2743-HT2846 Caldesmon 1 , AIL Splice 4, Non-Muscle 
ESTs 
ESTs 

BH-protocadherin (brain-heart) 
methylmalonyl Coenzyme A mutase 
EST* 



H.sapiens mRNA for ArgBPIB protein 



3 Hs.42738 
9 Hs.34073 
7 Hs.155212 
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131710 Hs.30985 AA233225 ESTs; Highly similar to (defline not ava 2.9 

12S231 Hs.268903 W84714 ESTs 2.9 

127380 Hs.15535 AI417137 Homo sapiens clone 24582 mRNA sequence 2.9 

104229 Hs.61289 AB002346 inositol phosphate 5'-phosphatase 2 (syn 2.9 

5 126600 Hs.191385 AA699949 ESTs 2.9 

125175 Hs.303030 W52355 EST 2.9 

103849 Hs.34578 M187045 ESTs; Weakly similar to !!!! ALU SUBFAMI 2.9 

102126 Hs.78961 U14575 protein phosphatase 1 ; regulatory (inhib 2.9 

124906 Hs.107815 R87647 ESTs 2.9 

10 131148 Hs,303125 C00038 ESTs 2.9 

123158 Hs.2 18329 AA488658 heat shock 70kD protein 1 2.9 

133667 Hs.75462 U72649 Human BTG2 (BTG2) mRNA; complete cds 2.9 

105182 Hs.18271 AA191014 ESTs; Weakly similar to Ydr372cp [S.cere 2.9 

13396B Hs.232068 D15050 Human mRNA for transcription factor AREB 2.9 

15 117425 Hs.336901 N27154 ESTs 2.9 

111087 Hs.37637 N59645 ESTs 2.9 

129641 Hs.11805 N66066 ESTs 2.9 

128639 Hs.102897 N91246 ESTs 2.9 

133209 Hs.79265 AA114183 ESTs; Moderately similar to glutamate py 2.9 

20 135154 Hs.267812 AA126433 sorting nexin 4 2.9 

126838 Hs.279609 AA858097 pigment epithelium-derived factor 2.9 

103803 Hs.106149 AA127696 ESTs 2.9 

102139 Hs.2128 U15932 dual specificity phosphatase 5 2.9 

128104 AA971000 op67g11.s1 Soares_NFL_T_GBC_S1 Homosapi 2.8 

25 127834 Hs.337631 AA761415 nz22dOS.s1 NC1_CGAP_GCB1 Homo sapiens cD 2.8 

133101 Hs.180952 AA488230 ESTs 2.8 

127250 Hs.217916 AI023717 ESTs 2.8 

135063 Hs.93883 D10537 myelin protein zero (Charcot-Marie-Tooth 2.8 

126323 Hs.68644 N45014 yy80g06.r1 Soares_multiple_solerosisJNb 2.8 

30 121873 Hs.145696 AA426270 ESTs 2.8 

122090 Hs.98684 AA432141 ESTs 2.8 

118728 Hs.322645 N73705 ESTs 2.8 

135400 Hs.99915 M23263 androgen receptor (dihydrotestosterone r 2.8 

125278 Hs.129998 W93523 ESTs 2.8 

35 124337 Hs.1 09019 N27637 ESTs 2.8 

124803 Hs.12186 R45480 cyclinK 2.8 

H45968 Hs.32149 H45968 ESTs 2.8 

104261 Hs.5409 AF008442 RNA polymerase I subunit 2.8 

105336 Hs.282093 AA236356 ESTs 2.8 

40 106070 Hs.5957 AA417761 Homo sapiens clone 24416 mRNA sequence 2.8 

131356 Hs.25960 M13241 v-myc avian myelocytomatosis viral relat 2.8 

112009 Hs.26255 R42714 EST 2.8 

133199 Hs.250175 AA609773 Homo sapiens clone 23904 mRNA sequence 2.8 

110379 Hs.33130 H44825 ESTs 2.8 

45 103890 HS.720B5 AA236343 ESTs; Weakly similar to unknown [S.cerev 2.8 

128152 R20353 yg20f10.r1 Soares infant brain 1 NIB Homo 2.8 

107008 Hs.23740 AA598710 ESTs 2.8 

135243 Hs.97101 AA215333 ESTs 2.8 

103058 Hs.184510 X57348 stratifin 2.8 

50 132020 Hs.293845 AA428990 ESTs 2.8 

116354 Hs.292566 AA504262 ESTs 2.8 

125867 Hs.12372 H98141 ESTs 2.8 

120603 Hs.98541 AA282787 ESTs; Highly similar to (defline not ava - 2.8 

115119 Hs.46847 AA256524 Human DNA sequence from clone 30M3 on ch 2.8 

55 133865 Hs.1 70290 F09315 discs; large (Drosophila) homolog 5 2.8 

109415 Hs.1 10826 AA227219 Homo sapiens CAGF9 mRNA; partial cds 2.8 

128687 Hs.23767 Z38910 ESTs 2.8 

109984 Hs.10299 H09594 ESTs; Moderately similar to !!!! ALU SUB 2.8 

133179 Hs.66731 U81599 homeoboxB13 2.8 

60 115998 Hs.336629 AA448488 ESTs; Weakly similar to zinc finger prot 2.8 

112180 Hs.25067 R49116 EST 2.8 

120428 Hs.173694 AA236822 ESTs; Moderately similar to (defline not 2.8 

106241 Hs.6019 AA430108 ESTs 2.8 

131060 Hs.22564 AA160890 myosin VI 2.8 

65 111383 Hs.40919 N94527 ESTs 2.8 

102123 Hs.1594 U14518 centromere protein A (1 7kD) 2.8 

102722 Hs.79981 U79242 Human clone 23560 mRNA sequence 2.8 

129887 Hs.274324 W92041 PCAF associated factor 65 alpha 2.8 

126S63 Hs.181297 AA714635 ESTs 2.8 
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104367 Hs.134342 H17438 ESTs; Weakly similar to seventransmembra 2.8 

107316 Hs.193700 T63174 ESTs; Moderately similar to!!!! ALU SUB 2.8 

128059 Hs.145096 M972446 ESTs 2.8 

124447 N48000 ESTs 2.8 

5 111398 Hs.1 25565 R00086 deafness; X-linked 1; progressive 2.8 

134085 Hs.79018 U2Q979 chromatin assembly factor I (1 50 kDa) 2.8 

124788 Hs.100912 R43543 ESTs 2.8 

112248 Hs.326416 R51361 ESTs 2.8 

121309 Hs.97312 AA402482 ESTs 2.8 

10 103076 Hs.75319 X59618 ribonucleotide reductase M2 polypeptide 2.8 

107071 Hs.35198 AA609053 ESTs 2.8 

104425 Hs.35380 H8849B ESTs 2.8 

132991 Hs.62245 AA446906 solute carrier family 25 (mitochondrial 2.8 

10496B Hs.29669 M084602 ESTs 2.8 

15 121153 Hs.97694 AA399640 ESTs 2.8 

131216 Hs.243901 D31058 ESTs 2.8 

109682 Hs.22869 F09299 ESTs 2.8 

131990 Hs.168818 H77734 ESTs; Moderately similar to roundabout 1 2.8 

132027 Hs.181444 N78844 ESTs; Weakly similar to R12C12.6 [C.eleg 2.8 

20 127383 Hs.190478 AA447990 ESTs 2.8 

132598 Hs.530 M81379 collagen; type IV; alpha 3 (Goodpasture 2.8 

101121 Hs.1313 L09753 tumor necrosis factor (ligand) supertami 2.8 

123000 Hs.105640 AA479347 ESTs 2.8 

121329 Hs.1755 AA404324 ESTs 2.8 

25 100481 Hs.121489 HG1098-HT1098 CystatinD 2.7 

113803 Hs.283683 W42789 ESTs 2.7 

110934 Hs.169001 N48708 ESTs; Weakly similar to cytochrome P-450 2.7 

432888 T86823 ESTs 2.7 

121802 Hs.188898 AA424328 ESTs 2.7 

30 130396 Hs.155313 AB002331 Human mRNA for KIAA0333 gene; partial cd 2.7 

121103 Hs.97697 AA398936 ESTs; Weakly similar to (defline not ava 2.7 

131129 Hs.23240 R27296 ESTs 2.7 

130943 Hs.272429 D50855 calcium-sensing receptor (hypocalciuric 2.7 

134676 Hs.87819 W28051 ESTs; Weakly similar to keratin 9; cytos 2.7 

35 111900 Hs.25318 R39044 ESTs 2.7 

106025 Hs.173334 AA412063 ESTs 2.7 

126144 Hs.40639 N39696 yx92a07.rl Soares melanocyte 2NbHM Homo 2.7 

103248 Hs.75262 X77383 cathepsinO 2.7 

127230 Hs.274170 H30501 Homo sapiens Opa-interacting protein OIP 2.7 

40 101584 Hs.84072 M35252 transmembrane 4 superiamily member 3 2.7 

124131 Hs.167489 H1S980 ESTa 2.7 

129689 Hs.77873 AA130156 ESTs 2.7 

132892 Hs.9973 W92797 ESTs 2.7 

120827 Hs.132967 AA347717 ESTs 2.7 

45 134579 Hs.85963 N23222 ESTs; Moderately similar to !!!! ALU SUB 2.7 

106149 Hs.256301 AA424881 ESTs 2.7 

132037 Hs.332541 AA203649 ESTs; Weakly similar to HEM45 [H.sapiens 2.7 

130542 Hs.179825 U64675 Human sperm membrane protein BS-63 mRNA, 2.7 

122851 Hs.99598 AA463627 ESTs 2-7 

50 134983 Hs.196384 D28235 prostaglandin-endoperoxide synthase 2 (p 2.7 

120537 Hs.1 60422 AA262790 ESTs 2.7 

131036 Hs.174140 X64330 ATP citrate lyase 2.7 

133889 Hs.211582 AA099391 ESTs " 2.7 

128847 Hs.106529 AA424199 w81e01.r1 Soares_tota(Jetus_Nb2HF8_9w 2.7 

55 112755 Hs.306044 R93802 ESTs 2.7 

423239 AA323591 EST26392 Cerebellum II Homo sapiens cDNA 2.7 

105031 Hs.12321 AA127240 ESTs 2.7 

126021 Hs.187516 AA775894 ESTs 2.7 

102116 U13706 Human ELAV-like neuronal protein 1 isofo 2.7 

60 133394 Hs.237225 R16759 ESTs; Weakly similar to (defline not ava 2.7 

104267 Hs.278439 C00358 ESTs 2.7 

107614 Hs.40241 AA004878 ESTs; Highly similar to (defline not ava 2.7 

129809 Hs.1259 X55283 asialoglycoprotein receptor 2 2.7 

112109 Hs.283309 R45221 ESTs; Weakly similar to !!!! ALU SUBFAMI 2.7 

65 128422 T85681 yd60cO6.r1 Soares fetal liver spleen 1NF 2.7 

109494 Hs.43899 AA233702 ESTs 2.7 

118696 Hs.292284 N72086 Homo sapiens RNA polymerase III largest 2.7 

106053 Hs.36727 AA416963 ESTs; Highly similar to histone H2A [H.S 2.7 

104440 Hs.284380 L20492 gamma-glutamyltransferase 1 2.7 
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129426 Hs.1 11 323 AA412087 EST; Highly similar to (deflina not avai 2.7 

123798 AA620411 small inducible cytokine A5 (RANTES) 2.7 

106716 Hs.238928 AA464962 ESTs 2.7 

103663 Z78291 Z78291 Homo sapiens brain fetus Homo sap 2.7 

5 114162 Hs.22265 Z38909 ESTs 2.7 

113063 Hs.5027 T32438 ESTs 2.7 

127897 AA773857 af80c09.r1 Soares_NhHMPu_S1 Homo sapiens 2.7 

130621 Hs.16803 AA621718 ESTs; Weakly similar to (defline not ava 2.7 

1 16245 Hs.42796 ' AA479958 ESTs; Highly similar to (defline not ava 2.7 

10 125499 R11878 yf49d1 1 ,r1 Soares infant brain 1 NIB Homo 2.7 

133960 Hs.77899 M19267 tropomyosin 1 (alpha) 2.7 

104470 Hs.246358 N28843 ESTs; Weakly similar to Similar to colla 2.7 

134982 Hs.92308 N46086 ESTs 2.7 

106803 Hs.284295 AA479114 ESTs 2.7 

15 104899 Hs.285574 AA054726 ESTs 2.7 

125401 Hs.337585 AI204637 ESTs; Moderately similar to KIAA0350 [H. 2.7 

111253 Hs.15768 N70042 ESTs; Moderately similar to !!!! ALU SUB 2.7 

118449 Hs.164478 N66413 ESTs; Weakly similar to (defline not ava 2.7 

134507 Hs.84318 M63488 replication protaln A1 (70kD) 2.7 

20 121609 Hs.98185 AA416867 EST 2.7 

113835 Hs.27475 W56590 ESTs 2.7 

113962 Hs.285290 W86375 ESTs; Highly similar to (defline not ava 2.7 

121913 Hs.98558 AA428062 ESTs 2.7 

108194 Hs.216717 AA057250 ESTs 2.7 

25 130799 Hs.12696 AA464273 ESTs 2.7 

123184 Hs.18166 AA489072 Homo sapiens mRNA for KIAA0870 protein; 2.7 

103420 Hs.173497 X97065 SEC23-like protein B 2.7 

106186 Hs.6315 AA427398 acetylserotonln N-methyltransferase-like 2.7 

101349 L77559 Homo sapiens DGS-B partial mRNA 2.7 

30 112954 Hs.6655 T16559 ESTs 2.7 

133054 Hs.291079 R07876 ESTs; Weakly similar to unknown [S.cerev 2,7 

128131 Hs.25640 AI283162 claudin 3 2.6 

101864 Hs.75777 M95787 transgelin 2.6 

111948 Hs.26303 R40752 ESTs 2.6 

35 130145 Hs.151051 U07620 protein kinase mitogen-activated 10 (MAP 2.6 

126507 Hs.23964 AI362218 ESTs 2.6 

117903 Hs.47111 N50740 ESTs 2.6 

116345 Hs.199067 AA496981 ESTs 2.6 

132227 Hs.4248 AA412620 ESTs 2.6 

40 125746 Hs.274256 H03574 yj42b06.t1 Soares placenta Nb2HP Homo sa 2.6 

105073 Hs.89463 AA137034 ESTs 2.6 

102764 U82310 Homo sapiens unknown protein mRNA, parti 2.6 

131367 Hs.173933 AA456687 ESTs 2.6 

130792 Hs.19500 AA307896 nuclear localization signal deleted in v 2.6 

45 107427 Hs.46736 W26975 ESTs 2.6 

117477 Hs.44175 N30328 ESTs 2.6 

106290 Hs.16364 AA435542 ESTs 2.6 

126829 Hs.7910 R11547 ESTs 2.6 

118836 Hs.173001 N79820 ESTs 2.6 

50 100147 Hs.136348 D13666 osteoblast specific factor 2 (fasciclin 2.6 

104278 Hs.109253 C02582 ESTs; Highly similar to (defline not ava 2.6 

135051 Hs.83484 C15324 ESTs 2.6 

126081 Hs.227835 AI346024 collagen; type I; alpha 1 - 2.6 

123579 AA608983 af5d4.s1 Soares_testis_NHT Homo sapiens 2.6 

55 130115 Hs.149923 M31627 X-box binding protein 1 2.6 

101434 Hs.1430 M20218 coagulation factor XI (plasma thrombopla 2.6 

122962 Hs.104720 AA478429 ESTs; Moderately similar to !!!! ALU SUB 2.6 

126151 Hs.40808 AA324743 ESTs 2.6 

128925 Hs.21851 D61676 Homo sapiens mRNA; cDNA DKFZp586J21 18 (f 2.6 

60 128919 Hs.103391 L27559 insulin-like growth factor binding prote 2.6 

130296 Hs.154103 R09286 LIM protein (similar to rat protein kina 2.6 

128402 Hs.191637 AA457244 ESTs 2.6 

129273 HS.109968 W63783 ESTs 2.6 

125433 Hs.7788 F07759 ESTs 2.6 

65 132953 Hs.321264 AA029927 ESTs 2.6 

130963 Hs.21639 U57099 nuclear protein; marker for differential 2.6 

120614 Hs.194154 AA284281 ESTs; Weakly similar to !!!! ALU SUBFAMI 2.6 

123251 Hs.103267 AA490858 ESTs; Moderately similar to Rabin3 [R.no 2.6 

121710 Hs.96744 AA419011 ESTs 2.6 
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125428 Hs.851 W74608 ESTs; Highly similar to (defline not ava 2.6 

115906 Hs.82302 AA435616 ESTs 2.6 

108432 AA076626 Homo sapiens clone 23851 mRNA sequence 2.6 

126191 Hs.191911 H97728 ESTs 2.6 

5 106164 Hs.281434 AA425773 ESTs 2.6 

111519 Hs.268615 R08165 ESTs 2.6 

134590 Hs.173840 W58612 ESTs 2.6 

102565 U59748 Human desert hedgehog (hDHH) mRNA, parti 2.6 

129879 Hs.13109 AA194973 ESTs 2.6 

10 114264 Hs.334609 Z40074 ESTs 2.6 

106236 Hs.21104 AA429951 ESTs 2.6 

135192 Hs.321709 AF000234 purinergic receptor P2X; ligand-gated to 2.6 

109833 Hs.29889 H00580 ESTs 2.6 

105756 Hs.8535 AA303088 ESTs; Weakly similar to transformation-r 2.6 

15 121422 Hs.97967 AA406210 ESTs 2.6 

130417 Hs.155485 U58522 Human huntingtin interacting protein (HI 2.6 

124312 Hs.102329 H94647 ESTs 2.6 

108996 Hs.97199 AA156058 ESTs 2.6 

127081 Hs.180591 R88362 ESTs; Weakly similar to weak similarity 2.6 

20 129574 Hs.11463 AA458603 ESTs; Weakly similar to (defline not ava 2.6 

112410 Hs.26904 R61680 ESTs 2.6 

123929 Hs.1 12981 AA621364 ESTs 2.6 

122905 Hs.104835 AA470070 ESTs 2.6 

116399 Hs.110637 AA599729 Homo sapiens homeobox protein A1Q(HOXA1 2.6 

25 130279 Hs.153934 AA424044 core-binding factor; runt domain; alpha 2.6 

130021 Hs.1435 M24470 guanosine monophosphate reductase 2.6 

100585 Hs.199160 HG2367-HT2463 Trithorax Homolog Hrx 2.6 

104965 Hs.30177 AA084104 ESTs 2.6 

117711 Hs.46485 N45201 EST 2.6 

30 124792 Hs.48712 R44357 ESTs 2.6 

111299 Hs.74313 N73808 ESTs 2.6 

103616 Hs.32971 Z46973 phosphoinositide-3-kinase; class 3 2.6 

133629 Hs.195614 D13642 KIM0017 gene product 2.6 

126484 Hs.169977 AI086782 ESTs 2.6 

35 100858 HG4245-HT4515 Forkhead Family Afx1 2.6 

133547 Hs.301 927 X02883 T-cell receptor; alpha (V;D;J;C) 2.6 

126680 Hs.1 33865 F07097 ESTs 2.6 

125739 Hs.92137 AA428557 v-myc avian myelocytomatosis viral oncog 2.6 

102276 Hs.10247 U30999 Human (memc) mRNA, 3'UTR 2.6 

40 105586 Hs.191538 AA279137 ESTs 2.6 

103978 Hs.34136 AA307443 ESTs 2.6 

125054 Hs.268601 T80622 ESTs; Weakly similar to (defline not ava 2.6 

114212 Hs.21201 Z39338 ESTs; Highly similar to (defline not ava 2.6 

116959 Hs.40022 H79310 EST 2.6 

45 109228 Hs.306995 AA1 93366 ESTs 2.6 

133989 Hs.78202 U29175 SWI/SNF related; matrix associated; acti 2.6 

100640 Hs.182183 HG2743-HT2845 Caldesmon 1 , Alt Splice 3, Non-Muscle 2.6 

133093 Hs.285996 AA598749 ESTs 2.6 

114306 Hs.6540 Z40861 ESTs 2.6 

50 106060 Hs.171391 AA417287 C-terminal binding protein 2 2.5 

107748 Hs.60772 AA017258 EST 2.5 

100134 Hs.49 D13264 macrophage scavenger receptor 1 2.5 

133959 Hs.78 U13044 GA-binding protein transcription factor; - 25 

130992 Hs.74316 AA455001 ESTs 2.5 

55 127493 Hs.291701 AA808081 oc39a08.s1 NCI_CGAP_GCB1 Homo sapiens cD 2.5 

1328S9 Hs.203961 N26855 ESTs 2.5 

117570 Hs.44583 N34415 EST 2.5 

124644 Hs.109654 N91279 ESTs 2.5 

103558 Hs.2785 Z19574 keratin 17 2.5 

60 132883 Hs.5897 AA047151 ESTs 2.5 

102009 Hs.82643 U02680 protein tyrosine kinase 9 2.5 

116058 Hs.20159 AA454156 ESTs 2.5 

121989 Hs.193784 AA430044 ESTs 2.5 

131257 Hs.24908 AA256042 ESTs 2.5 

65 100320 Hs.75275 D50916 homolog of yeast (S. cerevisiae) ufd2 2,5 

102959 Hs.121524 X15722 glutathione reductase 2.5 

132969 Hs.6166 AA047616 ESTs 2.5 

130869 Hs.2057 AA128100 uridine monophosphate synthetase (orotat 2.5 

129645 Hs.1 18131 L38928 5;10-methenyltetrahydrofolate synthetase 2.5 
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HS.838B3 


M128075 


zl16d00 M Soares_pregnanLuterus NbHPU 


25 




Hs.78935 


U29607 


Homo sapisns olF-2-associated p67 homoto 




109816 


Hs.61960 


F11013 


ESTs; Weakly similar to KIAA0176 IH.sapi 


is 


134801 


Hs.89695 


X02160 


insulin receptor 


2.5 


104232 


Hs.10587 


AB002351 


Human mRNA (or KIAA0353 gene; partial cd 


2.5 


107361 


Hs.159436 U72513 


Human RPL13-2 pseudogene mRNA; complete 


2.5 


106057 


Hs.289074 M417067 


ESTs 


2.5 


134252 


Hs.80720 


M031782 


Homo sapiens mRNA; cDNA DKFZp586B1722 (f 


2.5 


128062 


Hs.1 05547 


M379500 


ESTs 


2.5 


110009 


Hs.6614 


H10933 


ESTs 


2.5 


111375 


Hs.20432 


N93695 


ESTs 


2.5 


122642 


Hs.99361 


M454186 


ESTs 


2.5 


127999 


Hs.69851 


M837495 


ESTs; Weakly similar to Wiskott-Aldrich 


2.5 


105029 


Hs.13268 


M126855 


ESTs 


2.5 


105082 


Hs.26765 


AA143763 


ESTs; Weakly similar to Similarity to S. 


2.5 
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TABLE 1 A show the accession numbers for those primekeys lacking unigenelD's for Table 
1. For each probeset we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Unique Eos probeset identifier number 
Gene cluster number 
Genbank acoession numbers 



108552 111555J 
126023 1596O90J 
126086 160621 6_1 
102565 32479J 
101964 48158_-7 
125499 1562851J 
125596 1708455.1 
118417 37186J 



125661 327827J 
125957 1583542J 
125982 1766315J 
127248 227560 1 
103731 112052J 
127261 231687J 
127265 232391J 
126659 1541209J 
127315 3793BJ 
103806 112618 1 
128104 502608J 
104602 524482_2 



AA071210 AA069899 AA071438 AA084912 AA084803 M079371 AA079370 

H57661 H58881 

H75681 H70975 

AB010994 U59748 M064660 

S81578 

H10543R11878 
R25698R56582 R56018 

AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 US7587 AA463992 AW206802 A1970376 AI583718 AI672574 
N25695 AW665466 AI81 8326 AA126128 AI430345 AW013S27 AA248638 AI214968 AA204735 AA207155 AA206262 
AA204833 AW003247 AW496808 AI080480 AI631703 AI651023 AI867418 AWS18140 AA502500 AI206199 AI671282 
AI352545 BE501030 AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 
AA332922 N66048 M703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 
BE46661 1 AI206344 AA574397 AA348354 AI493192 

AA491 830 R50173 R55192 RE0320 AI732306 AI732305 AI820727 AI820728 R55191 R50319 R50227 
H41694 H45213 
R98091 W92898 

AA364195AA325029AW962050 

AA070545M131490M131373 

AA330501 AA661567 

AA331503AA332751 AW962542 

T16245 R19694 F13545 H10299 T66048 T65279 H18006 

AF1 16622 AI1 14507 AA640834 AA377999 

AA130614AA071410 

M906093AA971000 

H47610 R86920 



7 219802 -2 
9 158447 1 
9 genbank_AA608983 
5 genbank_AA180496 
9 tigr_HT4163 sefwaa 

8 tigr_HT4515 U10072 



T77794 T85681 
AA773681 AA773857 

BE298210 AI672315 AW086489 BE298417 AA455921 AA902537 BE327124 R14963 AA085210 AW274273 AI333584 
AI369742 AI039658 AI885095 AI476470 AI287650 AI885299 AI985381 AW592624 AW3401 36 AI266556 AA456390 
AI310815AA484951 

AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 
AW967671 AA908598 AA251B75 AI820501 AI820532 WB7891 T85904 U71456 T82391 BE328571 T75102 R34725 
AA884922 BE328517 AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 
AA663345 AW008282 AA488964 AA263144 AI890387 AI950344 AI741346 A1689062 AA282915 AW102898 AI872193 
AI763273 AW173586 AW150329 AI653832 AI762688 M988777 AA488892 AI356394 AW103813 AI539642 AA642789 
AA856975 AW505512 AI961530 AW629970BE612881 AW276997AW513601 AW5 12843 AA044209 AW85S538 
M180009 AA337499 AW961 101 AA251669 AA251874 AI81 9225 AW205862 AI683338 A1858509 AW276905 AI633006 
AA972584 AA908741 AW072629 AW513996 AA293273 M969759 N75628 N22388 H84729 H60052 T92487 AI022058 
M780419 AA5510Q5 W80701 AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 
R67840 AA300207 AW959581 T63226 F04005 
AA487961 

AA178953AA192740 
AA608983 
AA18049S 
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123798 



579959J 



M620411AA287491 



102116 entrez_U13706 U1370S 

102398 entrez_U42359 U42359 

102764 entrez_U82310 U82310 

118475 g B nbank_N66845 N66B45 

104776 genbank_M026349 AA026349 

104787 genbank_M027317 AA027317 

113702 genbank_T97307 T97307 

113938 genbank_W81598 W81598 

122635 genbank_M454085 AA454085 

108407 genbank_M075519 M075519 

108432 genbank_M076626 M076626 

108555 genbank_M084963 M084963 

101349 entrez_L77559 L77559 

124447 genbank_N48Q00 N48000 

119071 genbank R31180 R31180 

103520 entrez_Y10511 Y10511 

103663 genbank_Z78291 278291 



123465 genbank_AA599033 AA599033 



128046 
126959 



877605J 
546044J 



AA873285AI025762 
AA199853AA206355 
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TABLE 2: shows a preferred subset of the Accession numbers for genes found in Table 1 
which are differentially expressed in prostate tumor tissue compared to normal prostate 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal body tissue (Relaxed ratio (87/70) 



Pkey ExAccn UnigenelD Unigene Title I 

131919 M121266 Hs.272458 ESTs i 

120328 M196979 Hs.290905 ESTs; Weakly similar to (defline not ava ! 

101486 M24902 Hs.1852 acid phosphatase; prostate i 

119073 R32894 Hs.279477 ESTs i 

133428 M34376 Hs.183752 mioroseminoprotein; beta- ! 

128180 AA595343 Hs.171995 kallikrein 3; (prostate specific antigen i 

104080 AA402971 Hs.57771 Homo sapiens mRNA for serine protease (T 



131665 R 
101050 K 
130771 N 
107485 V 
106155 A 
129534 P 
100569 H 



133944 A 
130974 X 
114768 A 



Hs.1832 

Hs.1915 

Hs.262476 

Hs.33287 

Hs.11260 



Hs.99872 
Hs.7780 
Hs.2178 



118417 tv 
126758 
107102 
116787 
115719 
123209 
101664 
112971 
117984 



119517 
105627 
101461 



Hs.14846 
Hs.268744 
Ms.61635 
Hs.95420 
Hs.1 13314 



ESTs 

neuropeptide Y 

folate hydrolase (prostate-specific memt 
S-adenosylmethionine decarboxylase 1 
ESTs 
ESTs 



fetal Alzheimer antigen 
ESTs 

H2B hisfone family; member Q 
ESTs 



W37145 
M609723 
H28581 
AA416997 
AA489711 
M60752 
T17185 
N51919 
M30894 
AA031360 
AA425887 
W47380 
AA281245 
M22430 



ESTs; Moderately similar to KIAA0273 [H. 
Homo sapiens BAC clone RG041D1 1 from7q2 10.7 
Homo sapiens mRNA for JM27 protein; comp 10.6 
ESTs 10.6 
ESTs; Weakly similar to polymerase [H.sa 10.5 
ESTs 102 



Antigen, Prostate Specific, Alt. Splice 16 



11.8 
11.4 



Hs.15641 
Hs.59622 
Hs.203270 
Hs.121017 
Hs.83883 
Hs.1 06778 
Hs.274509 
Hs.1 67133 
Hs.98502 



ESTs 

H2A histone family; member A 

ESTs 

ESTs 

T-cell receptor; gamma cluster 



10.1 



9.2 



1 L22524 
3 M86546 

2 AA280036 



ESTs 8.9 

ESTs 8.8 

phospholipase A2; group IIA (platelets; 8.7 

yz61c5.s1 Soares_multiple_sclerosisJNbH 85 

ESTs 8.2 

ESTs; Weakly similar to KIAA031 9 [H.sapi 8.1 

ESTs 8 

CD38 antigen (p45) 8 

ESTs 7.6 

ESTs 7.4 

ESTs 7.1 

Human N33 protein form 1 (N33) gene, exo 7 

matrix metalloproteinase 7 (matrilysin; 6.9 

pre-B-cell leukemia transcription factor 6.8 

ESTs; Weakly similar to W01A6.C [C.elega 6.8 
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109112 AA169379 

109795 F10707 

130336 X07730 

131425 AA219134 

5 132902 AA490969 

133724 U07919 

120215 Z41050 

131881 AA010163 

100727 X07290 

10 121770 AM21714 

123475 AA599267 

133061 AB0O0584 

116429 AA609710 

101233 L29008 

15 104691 AA011176 

127248 AA325029 

105500 AA256485 

130828 AA053400 

115357 AA281793 

20 116334 AA491457 

120132 Z38839 

106375 AA443993 

124777 R41933 

101791 M83822 

25 117698 N41002 

122041 AA431407 

133723 AA088851 

113938 W81598 

133015 AA047036 

30 108186 AA056482 

104466 N25110 

104033 AA365031 

110844 N31952 

129056 H70627 

35 133493 AA284143 

129184 W26769 

101448 M21389 

116188 AA464728 

105921 AA402613 

40 103375 X91868 

128871 AA400271 

116238 AA479362 

102913 X07696 

103011 X52541 

45 118981 N93839 



Hs.257924 
Hs.326416 
Hs.171995 
Hs.26691 



Hs.334786 
Hs.278428 
Hs.250528 
Hs.296638 



Hs.1 25019 
Hs.289072 
Hs.140237 



ESTs 6.8 

ESTs 6.7 

kallikrein 3; (prostate specific antigen 6.6 

ESTs 6.6 

ESTs 6.6 

aldehyde dehydrogenase 6 6.5 
Homo sapiens Mod4p homolog mRNA; complet 65 

upstream regulatory element binding prot 6.5 

Human HF.12 gene mRNA 6.3 

Homo sapiens mRNA for KIAA0896 protein; 6.3 

ESTs; Weakly similar to ANKYRIN; BRAIN V 6.3 

prostate differentiation factor 6.3 

ESTs; Weakly similar to similar to GTP-b 6.2 

sorbitol dehydrogenase 6.2 

ESTs 6.2 
EST27953 Cerebellum II Homo sapiens CDNA6.2 

ESTs 6.1 

ESTs 5.9 

ESTs 5.8 

ESTS 5.7 

ESTs; Weakly similarto !!!! ALU SUBFAMI 5.6 

ESTs 5.6 

ESTs; Weakly similar to neuronal thread 5.6 

Human beige-like protein (BGL) mRNA; par 5.5 



5.5 

Homo sapiens Chromosome 16 BAC clone CIT 5.5 



Hs.7780 ESTs 



Hs.167531 
Hs.1 08336 
Hs.194369 
Hs.109201 
Hs.195850 
Hs.184598 
Hs.169119 
Hs.54416 
Hs.106778 
Hs.47144 
Hs.80342 
Hs.326035 



Human guanine nucleotide exchange factor 5.3 

ESTs 5.3 

ESTs; Weakly similar to (defline not ava 5.3 

ESTs; Weakly similar to !!!! ALU SUBFAMI 5.3 

Homo sapiens chromosome 1 atrophin-1 rel 5.3 

ESTs; Highly similar to (defline not ava 5.2 

keratin 5 (epidermolysis bullosa simplex 5.1 

ESTs; Weakly similar to !!!! ALU SUBFAMI 5.1 



sine oculis homeobox (Orosophila) homolo 
ESTs; Highly similar to (defline not ava 
ESTs 



5.1 
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TABLE 2A shows the accession numbers for those primekeys lacking unigenelD's for Table 
2. For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
5 and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Unique Eos probeset Identifier number 



CAT number Accession 



118417 37186J 



AF08Q229 AFO80231 AF080230 AF080232 AF080233 AF080234 BES50633 AI636743 AW61 4951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 
AA204833 AW003247 AW496808 AI080480 A1631703 AI651023 AI867418 AW818140 AA502500 A1206199 AI671282 
AI352545 BE501030 AI652535 BE465762 AA206331 AW4518B6 AA471088 AA206342 AA204834 AA206100 AW021661 
AA332922 N66048 AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 
BE466611 AI206344 AA574397 AA348354 AI493192 
AA364195 AA325029 AW96205O 

AI141999 AA730176 R44544 R41778 AW300793 AW966157 AA9185Q1 AA599629 AI082195 AI198537 AW006520 
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TABLE 3: shows genes, including expression sequence tags, differentially expressed in 
5 prostate tumor tissue compared to normal tissue as analyzed using the Affymetrix/Eos Hu02 
GeneChip array. Shown are the relative amounts of each gene expressed in prostate tumor 
samples and various normal tissue samples showing the highest expression of the gene. 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD Unigene Title 



3 HG2261-HT23S2 
3 HG4020-HT4Z90 
3 L00354 Hs.80247 



phosphodiesterase l/nucleotide pyrophosp 

KIAA0056 protein 

Hs.171995 

Hs.2387 



101416 h 

101447 f, 

101485 I. 

101514 I 

101626 fv 

101663 f. 

101758 h 

101768 h 

101817 f. 



glycogen synthase kinase 3 beta 



Human alpha satellite and satellite 3 ju 
selectin E (endothelial adhesion molecui 
2 RAB3B; member RAS oncogene family 
pleiotrophin (heparin binding growth fac 
H2B histone family; member A 
pyrroline-5-carboxylate reductase 1 



3 U37519 
7 U48B07 
3 U49957 
9 U71207 
3 U75272 
1 U80034 
3 U90914 
9 X02544 
1 X54567 



Hs.95243 
Hs.2156 
Hs.505 
Hs.3844 
Hs.79334 
Hs.69171 
Hs.87539 
Hs.2359 
Hs.180398 
Hs29279 
Hs.1867 
Hs.68583 
Hs.5057 
Hs.572 
Hs.123114 
Hs.93379 
Hs.44926 



RAR-related orphan receptor A 
ISL1 transcription factor; LIM/homeodoma 
LIM domain only 4 
nuclear factor; interieukin 3 regulated 



aldehyde dehydrogenase 8 

dual specificity phosphatase 4 

LIM domain-containing preferred transbc 

eyes absent (Drosophila) homolog 2 

progastricsin (pepsinogen C) 

mitochondrial intermediate peptidase 



Antigen, Prostate Specific, AIL Splice 
Transglutaminase 10.5 



cystatin S 

eukaryotio translation initiation factor 
3 X60708 Hs.44926. dipeptidylpeptidase IV (CD26; adenosine 
coated vesicle membrane protein 
103401 X95240 Hs.54431 specific granule protein (28 kDa); cyste 

' SRY (sex-determining region Y)-box 9 (ca 
103677 Z83806 H.sapiens mRNA for axonemal dynein heavy 

AA298180 Hs.83243 ESTs 
AA410529 Hs.30732 ESTs 
AF006265 Hs.9222 
D45332 Hs.6783 



1 AA040882 

3 AA054228 

3 AA074880 

7 AA074919 



ESTs 

3 ESTs; Weakly similar to !!!! ALU SUBFAMI 
Hs.10290 U5 snRNP-specific 40 kDa protein <hPrp8- 



Hs.20509 
Hs.10026 
Hs.291000 
Hs.23729 
Hs.26369 



ESTs; Weakly similar to hypothetical pro 
ESTs; Weakly similar to ORF YJL063C [S.c 



Homo sapiens clone 24405 mRNA sequence 
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105304 AA233553 


Hs.1 90325 ESTs 


4.7 


105370 AA236476 


Hs.22791 ESTs; Weakly similar to transmembrane pr 




105427 AA2S1330 


Hs.28248 ESTs 




105542 AA261858 


Hs.266957 ESTs; Weakly similar to heat shock prate 




105628 AA281251 


Hs.79828 ESTs; Weakly similar to putative zinc ti 




105640 AA281623 


Hs.6685 ESTs; Weakly similar to KIM0742 protein 


8 


105645 AA28213B 


Hs.11325 ESTs 


14 


105691 AA287097 


Hs.289068 transcription factor 4 


6.3 


105730 AA292701 


Hs.5364 DKFZP564I052 protein 


4.9 


105808 AA393808 


Hs.286131 KIAA0438 gene product 


7 


105826 AA398243 


Hs.194477 ESTs; Moderately similar to similar to N 


5 


105903 AA401433 


Hs.200016 ESTs; Weakly similar to diphosphoinosito 


9.9 


105906 AA401633 


Hs.22380 ESTs 


11.5 


106065 AA417558 


Hs.25206 ESTs 


5.1 


106094 AA419461 


Hs.23317 ESTs 


10.9 


106157 AA425367 


Hs.34892 ESTs 


6.6 


106184 AA426643 


Hs.10762 ESTs 


8.5 


106211 AA428240 


Hs.126083 ESTs 


8.4 


106213 AA428258 


Hs.8769 Homo sapiens mRNA; cDNA DKFZp554E153 (fr 5.7 


106272 AA432074 


Hs.323099 ESTs 


5.8 


106369 AA443828 


Hs.288856 ESTs 


6.3 


106400 AA447621 


Hs.94109 ESTs 


5.4 


106474 AA45Q212 


Hs.42484 Homo sapiens mRNA; cDNA DKFZp564C053 (fr 9.2 


106507 AA452584 


Hs.267B19 protein phosphatase 1; regulatory {Inhib 


5.6 


106523 AA453441 


Hs.31511 ESTs 


4.7 


106532 AA453628 


Hs.37443 ESTs 


4.7 


106557 AA455087 


Hs.22247 ESTs 


5.7 


106575 AA456039 


Hs.105421 ESTs 


7.2 


106618 AA459249 


Hs.871 5 ESTs; Weakly similar to Similarity with 


5.6 


106820 AA481037 


Hs.12592 ESTs 


5.4 


106846 AA485223 


Hs.34892 ESTs 


5.3 


106973 AA505141 


Hs.11923 Human DNA sequence from clone 167A19 on 


7.5 


107110 AA609952 


Hs.12784 KIAA0293 protein 


6.1 


107127 AA620504 


Hs.179898 ESTs 


7.1 


107159 AA621340 


Hs.10600 ESTs; Weakly similar to ORF YKR081c [S.c 


5,2 


107217 D51095 


Hs.35861 DKFZF586E1621 protein 


15.1 


107365 U78294 


Hs.111256 arachidonate 15-lipoxygenase; second typ 


4.7 


107630 AA007218 


Hs.60178 ESTs 


5.3 


107734 AA016225 


Hs.7517 ESTs 


4.8 


107760 AA018042 


Hs.252085 EST 


7.6 


107997 AA0373B8 


Hs.82223 Human DNA sequence from clone 141 H5 on c 


10.5 


108012 AA038616 


Hs.173334 ESTs 


6.5 


108520 AA084138 


Hs.46786 ESTs 


7.9 


108583 AA088276 


Hs.68826 ESTs 


5.6 


108613 AA100967 


Hs.69165 ESTs 


6 


108664 AA1 13349 


Hs.69588 EST 


6.3 


108677 AA1 15629 


Hs.118531 ESTs 


5.9 


108807 AA129968 


Hs.49376 ESTs; Weakly similar to PROTEIN PHOSPHAT 5.8 


108910 AA136590 


ESTs 


5 


108933 AA147224 


Hs.337232 ESTs 


12.7 


108948 AA1 49579 


Hs.118258 ESTs 


6.8 


109014 AA156790 


Hs.262036 ESTs 


15.3 


109124 AA171529 


Hs.183887 ESTs 


6.1 • 


109142 AA176438 


Hs.41295 ESTs 


5.1 


109277 AA196332 


Hs.86043 ESTs 


5.5 


109342 AA213620 


Homo sapiens mRNA; cDNA DKFZp586M1418 (f6 


109562 F01811 


Hs.187931 ESTs; Moderately similar to voltage-gate 


10.8 


109565 F01930 


Hs.23648 ESTs 


7 


109648 F04600 


Hs.7154 ESTs 


S.9 


109799 F10770 


Hs.180378 Homo sapiens clone 669 unknown mRNA; com 6.4 


109859 H02308 


Hs.20792 ESTs 


5.3 


110181 H20276 


Hs.31742 ESTs 


16.8 


110854 N32919 


Hs.27931 ESTs 


10 


110924 N47938 


Hs.12940 yy84a09.s1 Soares_multiple_sclerosis_2Nb 


5.6 


111046 N55514 


Hs.318584 ESTs 


6.9 


111091 N59B58 


Hs.33032 Homo sapiens mRNA; cDNA DKFZp434N185 (fr 5.2 


111157 N66613 


Hs.99364 ESTs 


5 


111164 N66B57 


Hs.122489 ESTs; Weakly similar to III! ALU CLASS C 


5.6 


111221 N68869 


Hs.15119 ESTs 


6.2 
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111348 N90O41 Hs.9585 ESTs 5.4 

111353 N90430 Hs.6616 ESTs 5.3 

111495 R07210 Hs.9683 ESTs 5.8 

111540 R08850 Hs.9786 ESTs 6 

5 111579 R10657 Hs.167115 KIAA0830 protein 12.6 

111581 R106B4 Hs.5794 ESTs 7.1 

111734 R25375 Hs.128749 ESTs 6.2 

111861 R37460 Hs.25231 ESTs 9,4 

111870 R37778 Hs.18685 ESTs; Weakly similar to hypothetical pro 6.5 

10 111937 R40431 Hs.14846 Homo sapiens mRNA; cDNADKFZp564D016 (fr 4.8 

111987 R42036 Hs.6763 KIM0942 protein 6.4 

112184 R49173 Hs.330242 ESTs 5.6 

112286 R53765 Hs.158135 KIAA09B1 protein 9.3 

112380 R59740 Hs.5740 ESTs 4.7 

15 112452 R63841 Hs.157461 ESTs 6 

112601 R79111 Hs.78225 annexinAI 5.4 

112753 R93696 Hs.169882 ESTs 5.8 

112902 T09262 Hs.129190 ESTs 5.1 

112984 T23457 Hs.289014 ESTs 4.9 

20 113021 T23855 Hs.129836 KIAA1028 protein 10.8 

113083 T40530 Hs.266957 ESTs; Weakly similar to heat shock prate 5.7 

113200 T57773 Hs.10263 ESTs 7.3 

113494 T88878 Hs.86538 ESTs 8.7 

113849 W60439 Hs.8858 ESTs; Moderately similar to cbpl 46 [M.mu 4.9 

25 113883 W72382 Hs.11958 oxidative 3 alpha hydroxysteroH dehydra 4.7 

1 1 3950 W85765 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 6.7 

113986 W87462 Hs.21894 ESTs 5.9 

113989 W87544 HS.26B828 ESTs 4.7 

114124 Z38595 Hs.125019 ESTs; Highly similar to KIAA0886 protein 21.3 

30 114340 Z41395 Hs.143611 ESTs 9.6 

114346 Z41450 Hs.130489 ESTs 5.2 

114435 AA018216 Hs.164975 Bicaudal D (Drosophila) homolog 1 7.4 

114463 AA025370 Hs.40109 KIAA0872 protein 8.2 

114652 AA101416 Hs.107149 ESTs; Weakly similar to PTB-ASSOCIATED S 5.4 

35 114721 AA131450 Hs.103822 ESTs 4.8 

114730 AA133527 HS.33132B ESTs;WeaklysimilartoTheKIAA0138gen 5.1 

114833 AA234362 Hs.87159 ESTs; Moderately similar to CGI-66 prote 5.5 

114860 AA235112 Hs.42179 ESTs; Moderately similar to similar to m 6.3 

114884 AA235811 Hs.293672 ESTs 5.2 

40 114895 AA235177 Hs.76591 KIAA0887 protein 4.7 

114908 AA236545 Hs.54973 ESTs 5.2 

114932 AA242751 Hs.16218 KIAA0903 protein 5.7 

115084 AA255566 Hs.42484 Homo sapiens mRNA; oDNA DKFZp564C053 (fr 5.2 

115140 AA258030 Hs.279938 ESTs; Weakly similar to supported by GEN 5.9 

45 115468 AA287061 Hs.48499 ESTs; Highly similar to Bdeight protein 4.7 

115583 AA398913 Hs.45231 LDOC1 protein 7.6 

115709 AA412519 Hs.58279 ESTs 4.8 

115772 AA423972 Hs.131740 ESTs 5 

115774 AA424029 Hs.288390 ESTs; Moderately similar to dynamin; int 5.4 

50 115776 AA424038 Hs.81897 ESTs 5 

115821 AA427528 Hs.130965 ESTs; Weakly similar to ZINC FINGER PROT 13.7 

115955 AA446121 Hs.44198 Homo sapiens BAC clone RG054D04 from 7q3 10.6 

116024 AA451748 Hs.83883 Human DNA sequence from clone 71 8J7 one 6.8 - 

116108 AA457566 Hs.28777 ESTs 6 

55 116117 AA459117 Hs.31575 SEC63; endoplasmic reticulum translocon 7.3 

116146 AA460701 Hs.15423 ESTs 5.5 

116296 AA489033 Hs.62601 Homo sapiens mRNA; cDNA DKFZp586K1318 (f 5.7 

116379 AA521472 Hs.71252 ESTs 5.9 

116393 AA599463 Hs.306051 protein phosphatase 2 (formerly 2A); reg 5.9 

60 116401 AA599963 Hs.59698 ESTs 7.9 

116416 AA609219 Hs.39982 ESTs 9.2 

116587 D59325 Hs.121429 ESTs 5.2 

116601 D80055 Hs.45140 ESTs 4.9 

116684 F09156 Hs.66095 ESTs 7.2 

65 116722 F13654 HSFIH32 Stratagene cat#937212 (1992) Horn 5.5 

116766 H13260 Hs.95097 ESTs 5.9 

117453 N29568 Hs.108319 thyroid hormone receptor-associated prat 6.9 

117557 N33920 Hs.44532 diubiquitin 4.8 

117708 N45114 Hs.126280 ESTs 6.3 
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118001 N52151 Hs.47447 ESTs 11.4 

118229 N62339 Hs.166254 heat shock 90kD protein 1; alpha 6.2 

118599 N69207 Hs.203697 ESTs 5.8 

118645 N70358 Hs.125180 growth hormone receptor 7.1 

5 118873 N69861 Hs.44577 ESTs 6 

118985 N94303 Hs.55028 ESTs 9.3 

119107 R42424 Hs.63841 ESTs 6 

119126 R45175 Hs.117183 ESTs 17.9 

119271 T16387 Hs.65328 ESTs 6 

10 119357 T78324 Hs.250895 ESTs 5 

119721 W69440 Hs.48376 ESTs 15.4 

119741 W70205 Hs.43670 kinesin family member 3A 10.1 

119780 W72967 Hs.191381 ESTs; Weakly similar to hypothetical pro 5.3 

120217 Z41078 Hs.66035 ESTs 4.8 

15 120266 AA1 73939 Hs.205442 ESTs; Weakly similar to inner centromere 8.8 

120294 M190888 Hs. 153881 ESTs; Highly similar to NY-HEN-62 anlige 4.9 

120418 AA23601O Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (f 4.7 

120486 AA253400 Hs.1 37569 tumor protein 63 kDa with strong homolog 5.6 

120524 AA261852 Hs.192905 ESTs 4.9 

20 120571 AA280738 Hs.34892 ESTs 8.8 

120596 AA282074 Hs.237323 ESTs 6.2 

120713 AA292655 Hs.96557 ESTs 9.9 

120992 AA398246 Hs.97594 ESTs 16.4 

121429 AA406293 Hs.41167 ESTs 6.9 

25 121503 AA412049 Hs.290347 ESTs 7.6 

121512 M412105 Hs.193736 ESTs 5.8 

121816 AA424814 Hs.48827 ESTs 4.6 

122027 AA431302 Hs.98721 EST; Weakly similar to N-copine [H.sapie 5.6 

122294 AA437311 Hs.98927 ESTs 5.7 

30 122411 AA446859 Hs.99083 ESTs 6.5 

122791 AA460158 Hs.129836 KIAA1 028 protein 12.4 

122792 AA460225 Hs.99519 ESTs 5.1 
122969 AA478539 Hs,104336 ESTs 4.9 
123095 AA485724 Hs.27413 ESTs 5.4 

35 123100 AA485957 Hs.306219 Homo sapiens clone 25032 mRNA sequence 5 

123295 AA495981 Hs.250830 ESTs 4.7 

123311 AM96252 Hs.105069 ESTs 7.4 

123583 AA609006 Hs.1 11240 ESTs 9.1 

123619 AA609200 ESTs 4.7 

40 123645 AA60931O Hs.188691 ESTs 4.8 

123709 AA609651 Hs.1 12742 ESTs 7 

123938 C14333 Hs.108327 damage-specific DNA binding protein 1 (1 5 

• 124178 H45996 Hs.97101 putative G protein-coupled receptor 6.8 

124352 N21626 Hs.102406 ESTs 102 

45 124357 N22401 yw37g07.s1 Morton Fetal Cochlea Homo sap 10.6 

124515 N58172 Hs.109370 ESTs 142 

124911 R88992 Hs.174195 ESTs 4.8 

125154 W38419 ESTs 4.7 

125992 W01626 za36e07.r1 Soares fetal liver spleen 1NF 5.1 

50 126802 AA947601 Hs.97056 ESTs 5.1 

126812 Z36290 Hs.173933 ESTs; Weakly similar to NUCLEAR FACTOR 1 4.6 

127080 AA662913 Hs.190173 ESTs 5 

127308 AA507628 Hs.334390 ESTs 4.8 - 

127370 AI024352 Hs.70337 immunoglobulin superfamily; member 4 4.7 

55 127386 AI457411 HS.10672B ESTs 4.8 

127965 AA828760 Hs.292059 ESTs 4.8 

128172 AI400862 Hs.265130 ESTs 5 

128305 AI039722 Hs.279009 ESTs 5.8 

128420 AI088155 Hs.41296 ESTs; Weakly similar to unknown [H.sapie 17 

60 128467 AA176446 Hs.180428 ESTs; Weakly similar to hypothetical 43. 4.8 

128610 L38608 Hs.10247 activated leucocyte cell adhesion molecu 7.9 

128625 AA242816 Hs.102652 ESTs; Weakly similar to KIAA0437 JH.sapi 8.1 

128651 AA446990 Hs.103135 ESTs 6.5 

129088 AA215971 Hs.194431 KIAA0992 protein 5.2 

65 129136 N26391 Hs.250723 ESTs 5.1 

129171 AA234048 Hs.7753 calumenin 5.8 

129229 AA211941 Hs.109643 polyadenylate binding protein-interactin 5.8 

129386 N27524 Hs.260024 Cdc42 effector protein 3 52 

129467 AA410311 Hs.44208 ESTs 5.1 
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129564 H22136 


Hs J5295 guanylate cyclase 1 ; soluble; alpha 3 


16.3 


129699 AM58578 


Hs.12017 KIAA0439 protein; homolog of yeast libiqu 


9.2 


129821 F11019 


Hs. 12696 cortactin SH3 domain-binding protein 


8.6 


129823 X00948 


Hs.105314 relaxin 2 (H2) 


9.1 


129847 W46767 


Hs.296178 ESTs; Weakly similar to RNA POLYMERASE I 


5.4 


129912 AAQ47344 


Hs.107213 ESTs; Highly similar to NY-REN-6 antigen 


6.5 


129958 L20591 


Hs.1 378 annexin A3 




129977 J04076 


Hs.1 395 early growth response 2 (Krox-20 (Drosop 


B.B 


130061 U82256 


Hs.172851 arginase; type II 


7.4 


130241 U78313 


Hs.153203 MyoD family inhibitor 


4.9 


130466 N21679 


Hs.1 80059 ESTs 


5.8 


130541 X05608 


Hs.211584 neurofilament; light polypeptide (68kD) 


6.7 


130619 AA477739 


Hs.1 2532 ESTs 


6.4 


130925 N71935 


Hs.1 69378 multiple PDZ domain protein 


7.9 


130938 AA013250 


Hs.21398 ESTs; Moderately similar to PUTATIVE GLU 


6.2 




Hs.301444 signal sequence receptor; gamma (translo 


6.4 


131066 FO9006 


Hs.22588 ESTs 


5 


131126 F09012 


Hs.181326 myotubularin related protein 2 


6.4 


131310 J02960 


Hs.2551 adrenergic; beta-2-; receptor; surface 


7.9 


131487 AA253220 


Hs.27373 Homo sapiens mRNA; cDNA DKFZp56401763 (f 5.9 


131561 X59841 


Hs.294101 pre-B-cell leukemia transcription factor 


7.6 


131562 U90551 


Hs.28777 H2A histone family; member L 


5.1 


131579 N62922 


Hs.29088 ESTs 




131629 AA442119 


Hs.238809 ESTs 


4.9 


131682 AA428368 


Hs.30654 ESTs 


4.8 


131699 R68657 


Hs.90421 ESTs; Moderately similar to III! ALU SUB 


6.5 


131795 N32724 


Hs.32317 Sox-like transcriptional factor 


5.6 


132053 H93381 


Hs.38085 ESTs; Weakly similar to putative glycine 


12 


132122 U65092 


Hs.40403 Cbp/p300-interaoting transactfvaton wit 


5.6 


132191 AA449431 


HS.2B8361 KIAA0741 gene product 


8 


132256 AA608856 


Hs.431 murine leukemia viral (bmi-1) oncogene h 


5.5 


132482 AA429478 


Hs.238126 ESTs; Highly similar to CGI-49 protein 1 


6.6 


132533 AA021608 


Hs.172510 ESTs 


5.8 


132572 AA448297 


Hs.237825 signal recognition particle 72kD 


6.2 


132581 R42266 


Hs.52256 ESTs; Weakly similar to beta-TrCP protei 


16 


132700 N47109 


Hs.5521 ESTs 


6.8 


132701 AA279359 


Hs.55220 BCL2-associatedathanogene2 


5.3 


132725 L41887 


Hs.184167 splicing factor; arginine/serine-rich 7 


7.8 


132783 N74897 


Hs.278894 DEAD/H (Asp-Glu-Aia-Asp/His) box polypep 


5.9 


132790 X75535 




8 


132939 U76189 


Hs.61152 exostoses (multipls)-like 2 


5.2 


133142 F03321 


Hs.65874 ESTs 


52 


133342 U295B9 


Hs.7138 cholinergic receptor; muscarinic 3 


10.3 


133434 AA278852 


Hs.30212 ESTs 


5.8 




Hs.73826 protein tyrosine phosphatase; non-recept 


4.9 


133520 X74331 


Hs.74519 primase; polypeptide 2A(58kD) 


13.1 


133544 T33873 


Hs.74624 protein tyrosine phosphatase; receptor t 


4.6 


133608 013315 


Hs.75207 glyoxalase 1 


4.8 


133626 H75939 


Hs.75277 Homo sapiens mRNA; cDNA DKFZp586M141 (fr 5 


133633 D21262 


Hs.75337 nucleolar phosphoprotein p1 30 


6.3 


133797 S66431 


Hs.76272 retinoblastoma-binding protein 2 


6 


133928 N34096 


Hs.7766 ubiquitin-conjugating enzyme E2E 1 (homo 


5.4 


134095 U47414 


Hs.79069 cyclin G2 


5.2 • 




Hs.80667 RALBP1 associated Eps domain containing 


6.5 




Hs.8172 ESTs 


7 


134453 X70683 


Hs.83484 SRY (sex determining region Y)-box 4 


4.7 


134542 X57025 


Hs.851 12 insulin-like growth factor 1 (somatomedi 


7.7 


134570 U66615 


Hs.172280 SWI/SNF related; matrix associated; act 


6.4 


134592 U82613 


Hs.289104 Alu-binding protein with zinc finger dom 


5.4 


134654 W23625 


Hs.8739 ESTs; Weakly similar to ORF YGR200C [S.c 


5 


134666 AA482319 


Hs.8752 putative type II membrane protein 


5,4 


134806 Z49099 


Hs.89718 spermine synthase 


6.7 


134951 AM31480 


Hs.1 69358 ESTs 


9.8 


135066 X04602 


Hs.93913 inferleukin 6 (interferon; beta 2) 


5,7 


135155 AA358268 


Hs.166556 ESTs; Moderately similar to transcriptio 


4.9 


135411 L10333 


Hs.99947 reticulonl 


5.3 


300023 M10098 


AFFXcontrol:18SribosomalRNA 


4.6 


300254 AW079607 


Hs.55610 ESTs; Weakly similar to ZnT-3 [H.sapiens 


7.8 


300273 AW013907 


Hs.167531 ESTs; Moderately similar to predicted us 
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300319 AW1 57646 


hwm? ESTs; Weakly simiar to miwotu »a«i 




300566 H 86709 




58 


300578 AI989417 


HS.134289 ESTS 










300675 AA039352 


Hs 1 25034 ESTs; Weakly similar to ORF YDL040c [S .c 






Hs 248 1 7 ESTs; Weakly similar to Ki AA0986 protein 


















Hs.208341 ESTs; Weakly similar to KIAA0989 protein 






Hs 106823 ESTs; Weakly similar to putative zinc fi 




300834 AF1 09300 


Hs 147924 ESTs 




300923 AW1 36372 






300962 AA593373 


Hs.293744 ESTs 










301042 AI659131 


Hs.197733 ESTS 




301242 AW161535 


Hs.23782 ESTs 


11 8 


301254 A1049624 






301262 H29500 


Hs.7130 ESTs; Moderately similar to N-copine [H. 




301388 AA156879 


Hs 262036 ESTs; Weakly similar to ZINC FINGER PROT 




301563 AI802946 


Hs 44208 ESTs' Weakly similar to match to ESTs AA 




301656 AW008475 


Hs 151258 EST cluster (not in UniGene) with exon h 




301689 Z44810 






301783 AL046347 


Hs 83937 Homo sapiens PAC clone DJ1 159004 from 7p 




301805 AI800004 






301 B46 R20002 






301891 AF131855 






302005 AI869666 


Hs.123119 ESTs 




302056 AI457532 






302067 H05698 






302099 AL021397 












302214 AJ001454 






302236 AI128606 






302358 D8115Q 


Hs.322848 EST cluster (not in UniGone) with exon n 






Hs 218366 EST cluster (not in UniGene) with exon h 


26 8 


30248S AC003682 


Hs.1 83512 multiple UniGene matches 




302582 NM_000522 




54 


302785 AA425562 






302792 AA343696 


Hs.46821 ESTs; Weakly similar to putative [n.sapi 




302881 AA508353 


ns.iuooi4 reiaxin i imj 


788 


302892 N58545 


Hs.42346 histone deacetylase 3 




302970 AW1 18352 


Hs 312679 EST cluster (not in UniGene) with exon h 


74 


302977 AVV263124 






303029 AF199613 






303125 AF161352 


Hs.1 1 1782 EST cluster (not in UniGene) with exon n 




303280 AI571580 




43 


303306 AA215297 






303309 AL1 34164 


Hs.145416 ESTs 




303344 AA255977 


Hs.250646 ESTs* Highly similar to ubiquitin-conjug 


19 5 


303380 AA298471 






303401 AA758552 






303525 AW516519 


Hs.273294 ESTs 




303526 AA348111 


Hs.96900 ESTs 


12 1 


303540 AA355607 


Hs.309490 ESTs; Weakly similar to MMSET type I [H. 




303572 AW338520 


Hs.242540 ESTs 




303685 AW500106 


Hs.23643 EST cluster (not in UniGene) with exon h 




303699 D30891 


Hs.1 9525 EST cluster (not in UniGene) with exon h 


15.7 


303702 AW500748 


Hs.224961 ESTs; Weakly similar to 73 kDA subunit o 


6.3 


303718 AI741397 


Hs.1 14658 ESTs 


4.6 


303722 AA521510 


Hs.145010 ESTs 


12.5 


303732 AW502405 


Hs.1 25759 ESTs; Weakly similar to tumor suppressor 


4.3 


303735 AA707750 


Hs.1 69055 ESTs; Weakly similar to cis-Golgi matrix 


5.4 


303752 A1017286 


Hs.5957 EST cluster (not In UniGene) with exon h 


5.3 


303753 AW503733 


Hs.9414 ESTs 


13 


303813 AI275850 


Hs.1 14658 EST cluster (not in UniGene) with exon h 


7.8 


304053 R00493 


Hs.1 25565 translocase of inner mitochondrial membr 


4.8 


304218 N66373 


Hs.27973 ESTs; Weakly similar to ZK354.7 [C.elega 


6 


305200 AA668128 


Hs.45207 EST singleton (not in UniGene) with exon 


5.7 


306716 AI024916 


Hs.251354 ESTs 


5.7 
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307846 AI364186 EST singleton (not in UniGene) with exon 7.3 

307871 AI368665 Hs.31476 EST singleton (not in UniGene) with exon 5.4 

308050 AI460004 Hs.31608 EST singleton (not in UniGene) with exon 8.1 

308362 AI613519 Hs.105749 EST singleton (not In UniGene) with exon 5.5 

5 308923 AI863051 Hs.279815 ESTs 4.4 

309116 AI927149 Hs.29797 ribosomal protein L10 4.5 

309375 AW075342 Hs.9271 EST singleton (not in UniGene) with exon 7.4 

309674 AW205604 Hs.266009 ESTs; WeakI/ similar to !!!! ALU SUBFAMI 5 

310095 AI921750 Hs.144871 ESTs 5 

10 310098 AI685841 Hs.161354 ESTs 11.6 

310250 AI478629 Hs.158465 ESTs 5.8 

310365 AI262148 Hs.145569 ESTs 9.7 

310382 AI734009 Hs.127699 EST cluster (not in UniGene) 10.4 

310409 AI612775 Hs.145710 ESTs 4.6 

15 310431 AI420227 Hs.149358 ESTs 72.9 

310573 AW292180 Hs.156142 ESTs 7.6 

310598 AI338013 Hs.140546 ESTs 9.2 

310539 AW269082 Hs.175162 ESTs 4.5 

310787 AW262580 Hs.147674 ESTs 4.9 

20 310816 AI973051 Hs.224965 ESTs 7.6 

311251 AI655662 Hs.197698 ESTs 41.3 

311280 AI767957 Hs.198248 ESTs; Weakly similar to Y38A8.1 gene pro 4.5 

311330 AI679524 Hs.201629 ESTs; Moderately similar to!!!! ALU SUB 4.6 

311515 AW136713 Hs.23862 ESTs 5.9 

25 311574 AI824863 Hs.211420 ESTs 4.8 

311587 AI828254 Hs.271019 ESTs 5.8 

311596 AI682088 Hs.79375 ESTs 26.4 

311631 AI809519 Hs.27133 ESTs 6.4 

311 688 AW025661 Hs.240090 ESTs 7.4 

30 311783 AI682478 Hs.13528 EST 4.6 

311826 AA765470 Hs.85092 ESTs 6.7 

311853 AW014013 Hs.107056 ESTs 5.3 

311901 R16890 Hs.137135 ESTs 5.6 

311932 AW451654 Hs.257482 ESTs 4.3 

35 312153 AA759250 Hs.118625 cytochrome b-561 11 

312182 AA834800 Hs.326263 EST cluster (not in UniGene) 15.9 

312242 AI380207 Hs.125276 ESTs 4.7 

31229S C01367 Hs.127128 ESTs 5.3 

312407 R46180 Hs.153485 ESTs 6.2 

40 312424 AA847398 Hs.291997 ESTs 4.8 

312425 R49353 Hs.293892 ESTs 5.2 

312480 R68651 Hs.144997 ESTs 9.5 

312518 C17785 Hs.182738 ESTs 6.3 

312521 AA033609 Hs.239884 ESTs 11-2 

45 312527 AI695522 Hs.191271 ESTs 4.7 

312539 AI004377 Hs.200360 ESTs 7 

312546 A1623511 Hs.1 18567 ESTs 5.1 

312563 AA976064 Hs.180842 ESTs 6.5 

312623 AA694607 Hs.176956 EST cluster (not in UniGene) 10.8 

50 312857 AA772279 Hs.125914 ESTs 5 

312890 AI813S54 Hs.5957 ESTs 5.8 

312903 AA939266 Hs.278626 ESTs 7.7 

312905 H92571 Hs.234478 ESTs 6.5 - 

312976 AA836271 Hs.125830 ESTs 4.6 

55 312983 AI079278 Hs.269899 ESTs 5.1 

312996 AA249018 Hs.154331 EST cluster (not in UniGene) 7 

313035 N36417 Hs.144928 ESTs 6.3 

313166 AI801098 Hs.151500 ESTs 4.3 

313188 AI039702 Hs.179573 collagen; type I; alpha 2 4.8 

60 313218 AA827805 Hs.124296 ESTs 5 

313226 AI200281 Hs.123910 ESTs 5.9 

313325 AI420611 Hs.1 27832 ESTs 4.6 

313326 A1088120 Hs.122329 ESTs 7.4 
313425 AA745689 Hs.186838 ESTs; Weakly similar to similar to zinc 6.3 

65 313499 AI261390 Hs.146085 ESTs 5.6 

313540 AI797301 Hs.5740 ESTs 5.9 

313568 AW467376 Hs.129640 ESTs 4.3 

313569 AI273419 Hs.135146 ESTs; Weakly similar to ZK1058.5 [Ceteg 4.6 
313603 AW468119 Hs.287631 EST duster (not in UniGene) 6.8 
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313615 AW295194 


Hs 301997 DKFZP434N126 protein 




313625 AW468402 


Hs 254020 ESTs 




313634 AA688292 


Hs 337786 ESTs 






Hs 6390 ESTs 




313638 Al/530/5 


Hs.104627 ESTs 




313670 C16690 






313671 W49823 








Hs 120591 EST cluster {not in UniGene) 


134 


313703 Al 16 1293 


Hs 280380 ESTs" Weakly similar to KIAA0525 protein 




313712 AA768553 






313800 AW296132 


Hs 55098 ESTs 




313979 AI535895 


Hs 221024 ESTs 










314123 AW245993 


Hs. 223394 ESTs 




314171 AI821895 


Hs .193481 ESTs 


294 


314188 AL138431 


Hs 164243 ESTs 




314219 AL036001 


Hs 48376 ESTs 






Hs 189023 ESTs 




314237 AA732359 


Hs.86264 ESTs 




314284 AA731431 


Hs.293464 EST cluster (not in UniGene) 






Hs»125232 ESTs 


53 


314343 AI754701 


He 10175ft PQTc' ^ 




314530 AI052358 


nS.l93/£0 cols 




314691 AW207206 






314695 AW5Q2698 


Hs 118152 ESTs 




314785 AI538226 


Hs. 32976 ESTs 






Hs 109045 ESTs" Weakty similar to ORF YGR245c [S c 


fl 


314864 AA493811 


Hs.294068 ESTs 




314907 AI672225 






314916 AA548906 


Hs.1 22244 ESTs 




314954 AA521381 






314981 AA524953 


Hs.293334 ESTs 




315021 AA533447 


Hs.31 2989 EST cluster (not in UniGene) 




315051 AW292425 


Hs.1 63484 EST 




315052 AAS76910 


Hs.134427 ESTs 




315073 AW452948 






315084 AI821085 






315214 AI915927 


Hs.34771 ESTs 




315220 A1420753 


Hs.66731 ESTs 




315278 A1985544 


Hs.1 2450 ESTs 




315282 AI222165 


Hs,144923 ESTs 






Hs 104696 ESTs 


g 


315369 AA764918 


Hs 256531 ESTs 




315378 AI263393 


Hs.1 45008 ESTs 




315379 AJ378329 


Hs.t26629 ESTs 




315402 AW293424 


Hs.75354 ESTs 




315442 AA977935 




66 


315443 AW003416 


Hs 160604 ESTs 




315528 R37257 


Hs.184780 ESTs 




315593 AW198103 


Hs 158154 ESTs 


99 


315634 AA837085 


Hs.220585 ESTs 




315705 AW449285 






315707 AI418055 






315714 AA744015 


Hs.298l38 EST ciuster (not in UniGene) 




315740 T05558 


Hs.156880 EST cluster (not in UniGene) 




315762 A1391470 


Hs.158618 ESTs 




315769 AA744875 


Hs.189413 ESTs 


5 


315843 AA679430 


Hs.191897 ESTs 


5.7 


315990 AI600041 


Hs.190555 ESTs 


9.2 


316012 AA764950 


Hs.119898 ESTs 


4.3 


316036 AA708016 


Hs.190389 ESTs 


5.9 


316055 AA693880 


Hs.6947 EST cluster (not in UniGene) 


6.7 


316074 AW517542 


HS293273 ESTs 


5.5 


316100 AW203986 


Hs.213003 ESTs 


5.1 


316169 AI127483 


Hs.120451 ESTs 


&2 


316442 AA760894 


Hs.153023 ESTs 


17.1 


316491 AA766025 


Hs.186854 EST 


4.6 


316504 AW135854 


Hs.132458 ESTs 


4.3 


316667 AW015940 


Hs.232234 ESTs 


7.6 
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316854 AA831215 


Hs.159066 ESTs; Weakly similar to predicted using 


5.1 


316905 AW138241 


HS210846 ESTs 


6.4 


317008 AW051597 


Hs.143707 ESTs 


4.4 


317019 AA864968 


Hs.127699 ESTs 


11 


317194 AW445167 


Hs.126036 ESTs 


13.5 


317224 D56760 


Hs.93029 ESTs 


8.7 


317404 AI806B67 


Hs.126594 ESTs 


8.7 


317501 AA931245 


Hs.1 37097 ESTs 


11.1 


317548 AI654187 


Hs.195704 ESTs 


14.2 


317651 AW292779 


Hs.169799 ESTs 


5.8 


317758 AI733277 


Hs.128321 ESTs 


5.4 


317850 N29974 


Hs.152982 EST cluster (not in UniQene) 


11.4 


317869 AW295184 


Hs.129142 ESTs; Weakly similar to DEOXYRIBONUCLEAS 13.8 


317902 AI828602 


Hs.211265 ESTs 




317916 AI565071 


Hs.159983 ESTs 




318239 AI085198 


Hs.1 64226 ESTs 


13.1 


318268 AI817736 


Hs.182490 ESTs 




318327 AW294013 


Hs.200942 ESTs 




318363 R45530 


Hs.1440 gamma-aminobutyric acid (GABA) A recepto 


3 


318428 AI949409 


Hs.194591 ESTs 




318464 AI151010 


Hs,1 57774 ESTs 




318524 AW291511 


Hs.159066 ESTs 




318540 T30280 


Hs.274803 EST cluster (not in UniGene) 




318591 AW206806 


Hs.1 15325 ESTs 


4.8 


318615 AI133617 


Hs.1 0177 ESTs 


5.5 


318646 AW175665 


Hs.278695 ESTs 


5.7 


3186B7 AI493742 


Hs.165210 ESTs 


11 


318668 W26276 


Hs.136075 ESTs 


5.9 




Hs.7130 copine IV 


5.5 


319080 Z45131 


Hs.23023 ESTs 


16.9 


319181 F06504 


Hs.27384 EST cluster (not in UniGene) 


4.6 


319191 AF071538 


Hs.79414 prostate epithelium-specific Ets transcr 


5.6 


319233 R21054 


Hs.1 80532 ESTs 


4.9 


319586 D78808 


Hs.283683 ESTs 


8.2 


319750 AA621605 


Hs.1 17956 ESTs 


9.3 


319763 AAA60775 


Hs.6295 ESTs 


14.3 


319824 AA424266 


Hs.123642 EST cluster (riot in UniGene) 


12.8 


319838 AA337642 


Hs.95262 nuclear factor related to kappa B bindin 


5.1 


319913 AA179304 


Hs.271586 ESTs; Moderately similar to !!!! ALU SUB 


4.3 


319964 T80579 


Hs.290270 ESTs 


5.8 


320076 AI653733 


Hs.271593 ESTs 


8.5 


320102 AW296219 


Hs.1 15325 RAB7; member RAS oncogene family-like 1 


9.8 


320187 T99949 


Hs.303428 EST cluster (not in UniGene) 


9.8 


320211 AL039402 


Hs.125783 DEME-6 protein 


7.9 


320324 AF071202 


Hs.139336 ATP-binding cassette; sub-family C (CFTR 


562 




Hs.24144 EST cluster (not in UniGene) 


8.3 




Hs.237146 ESTs 


5.4 


320561 NM 006953 


Hs.159330 EST cluster (not in UniGene) 


7 


320574 ALQ49443 


Hs.161283 Homo sapiens mRNA; cDNA DKFZp586N2020 (f 4.4 


320576 AL049977 


Hs.162209 Homo sapiens mRNA; cDNA DKFZp564C122 (fr 6.7 


320654 AW263088 


Hs.1 18112 ESTs 


6 




Hs.31218 secretory carrier membrane protein 1 


13.5 




Hs.71721 ESTs 


62 - 


320813 AW360847 


Hs.16578 ESTs 


9.3 


320853 AI473796 


Hs.1 35904 ESTs 


8.1 


320856 D59945 


Hs.65366 EST cluster (not in UniGene) 


6 


320899 AA633772 


Hs.1 16798 ESTs 


92 


320918 AW195012 


Hs.293970 ESTs 


5 


320973 H19732 


Hs.247917 ESTs 


5.9 


321099 AA018386 


Hs.64341 ESTs 


4.6 


321190 H52462 


Hs.1 63872 EST cluster (not in UniGene) 


5.8 


321318 AB033041 


Hs.137507 EST cluster (not in UniGene) 


84 


321382 AW372449 


Hs.175982 EST cluster (not in UniGene) 


7.3 


321441 AW297633 


Hs.1 18498 ESTs 


14.7 


321538 H80483 


Hs.46903 EST cluster (not in UniGene) 


9.2 


321609 H86021 


Hs.182538 ESTs; Weakly simllarto hMmTRAlb [H.sapl 


4.8 


321636 AI791838 


Hs.193465 ESTs 


5.5 


321638 AI356352 


Hs.108932 ESTs 


4.6 


321644 AI204177 


Hs.237396 ESTs 


6.6 



129 



WO 02/30268 



PCT/US01/32045 



321681 AA233821 Hs.190173 EST cluster (not in UniGene) 4.6 

321726 X91221 Hs.144465 EST cluster (not in UniGene) 5 

321758 U29112 Hs.196151 EST cluster (not in UniGane) 6.2 

321877 AL109784 Hs.189222 EST cluster (not in UniGene) 4.6 

5 321899 N55158 Hs.29468 ESTs 4.6 

321902 AA746374 Hs.145010 ESTs 85 

322007 AW410646 Hs.164649 ESTs 5.1 

322055 AL137646 Hs.146001 EST cluster (not in UniGene) 4.3 

322092 AF085833 Hs.1 35624 EST cluster (not in UniGene) 4.3 

10 322221 AI890619 Hs.1 79662 micteosome assembly protein 1 -like 1 4.4 

322278 AF086283 EST cluster (not in UniGene) 5.8 

322303 W07459 Hs.157601 EST cluster (not in UniGene) 22 

322437 AW393804 Hs.1 70253 ESTs; Weakly similar to rabaptin-4 [H.sa 4.4 

322493 AF143235 Hs.279819 EST cluster (not in UniGene) 7.2 

15 322782 AA056060 Hs.202577 EST cluster (not in UniGene) 18.4 

322811 AA782292 Hs.105872 ESTs 6.9 

322818 AW043782 Hs.293616 ESTs 10.7 

322826 AI807883 Hs.180059 ESTs 5 

322887 AI986306 Hs.86149 ESTs; Weakly similar to KIAA0969 protein 11.9 

20 322889 AA081924 Hs.124918 ESTs 7.1 

322924 AA669253 Hs.136075 ESTs 4.5 

322982 AI351191 Hs.128430 ESTs 6.6 

322994 AA422116 Hs.191461 ESTs 4.7 

323040 AA336609 Hs.1 0B62 ESTs 6.9 

25 323041 AL1 18747 Hs.26691 EST cluster (not in UniGene) 8.3 

323045 AA148950 HS.18B836 ESTs 4.6 

323048 AL1 18923 Hs.175110 EST cluster (not in UniGene) 7.5 

323070 AA157726 Hs.264330 ESTs 7.5 

323071 AA157867 Hs.5722 ESTs 4.7 
30 323097 Z44354 Hs.296261 guanine nucleotide binding protein (G pr 4.9 

323131 AA176982 Hs.270124 EST cluster (not in UniGene) 6.1 

323136 AL120351 Hs.30177 EST cluster (not in UniGene) 4.3 

323175 AI827137 Hs.336454 ESTs 6.2 

323218 AF131846 Hs.13396 Homo sapiens clone 25028 mRNA sequence 6.3 

35 323226 AF055019 Hs.21906 Homo sapiens clone 24670 mRNA sequence 12.6 

323236 AA363148 Hs.293960 ESTs 10.9 

323262 AI829770 Hs.1 90642 ESTs 7.6 

323276 AA836452 Hs.323822 ESTs 7.6 

323287 AA639902 Hs.104215 ESTs 24.7 

40 323335 AI655499 Hs.161712 ESTs 14.1 

323341 AL134875 Hs.108646 ESTs 5.3 

323362 AL135067 Hs.1 17182 ESTs 6.1 

323486 C05278 Hs.299221 ESTs; Moderately similar to [PYRUVATE DE 8.5 

323496 AI826801 Hs.300700 ESTs 45 

45 323507 H71721 Hs.128387 ESTs 4.4 

323545 AI814405 Hs.224569 ESTs 5.8 

323623 AA314280 Hs.146589 EST cluster (not in UniGene) 5 

323663 AW263526 Hs.243023 ESTs 7.7 

323691 AA317561 Hs.145S99 EST cluster (not in UniGene) 5.9 

50 323810 AA740405 Hs.108806 ESTs 6.2 

323846 AA337621 Hs.137635 ESTs 6 

323929 AA354940 Hs.145958 ESTs 10.7 

323959 AI636775 Hs.6831 ESTs 5.4 - 

323996 AA367032 Hs.217882 ESTs 5.8 

55 323997 AA844907 Hs.274454 EST cluster (not in UniGene) 4.4 

324019 AW177009 EST cluster (not in UniGene) 4.6 

324130 AL046575 Hs.130198 ESTs 11 

324295 AI146686 Hs.143691 ESTs 13.7 

324296 AI524039 Hs.1 92524 ESTs 6.8 
60 324307 AA627642 Hs.4994 transducer of ERBB2; 2 (TOB2) 4.9 

324330 AA884766 EST cluster (not in UniGene) 4.3 

324385 F28212 Hs.284247 EST cluster (not in UniGene) 4.7 

324430 AA464018 Hs.184598 EST cluster (not in UniGene) 13.6 

324452 AW014022 Hs.170953 ESTs 7.6 

65 324547 AW501974 Hs.74170 ESTs 5.6 

324603 AW016378 Hs.292934 ESTs 24.2 

324617 AA508552 Hs.195839 ESTs 54 

324618 AI346282 Hs.87159 ESTs 4.6 
324620 AA448021 Hs.94109 EST cluster (not in UniGene) 5.7 
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324691 AK17963 

324696 AA641092 

324713 AW340249 

324715 AI739168 

324718 AI557019 

324720 AA578904 

324752 AI279919 

324753 AA612626 
324790 AI334367 



324845 AA361016 

324888 AI564134 

324929 A1741633 

324961 AA613792 

325108 AA401863 



330551 U39840 

330658 AA319514 

330700 AA037415 

330704 AA056557 

330705 AA102571 

330706 AA121140 
330712 AA167269 



Hs.1 29179 ESTs 
Hs.1 12451 ESTs 

Hs.293341 ESTs; Weakly similar to Pio-a2(XI) [H.sa 
Hs.257339 ESTs 
Hs.1 63440 ESTs 

Hs.131798 EST cluster (not in UniGene) 
Hs.1 16467 ESTs 
Hs.292437 ESTs 

Hs.272072 ESTs; Moderately similar to !!!! ALU SUB 
Hs.1 44871 EST cluster (not in UniGene) 
Hs.1 59337 ESTs 
Hs.1 4553 ESTs 
ESTs 



2 KIAA0853 protein 

0 ESTs 

EST cluster (not in UniGene) 
ESTs 

CH20_hsgi|6552458 

CH.21_hsgi|5867660 

CH.21_hsgi|6682516 

CH.07_hsgi|5868455 

CH.X_hsgi|5868837 

CH.16_p2gi|6165201 

CH.16_p2gi|5091594 

CH.16_p2gi|6671887 

CH.05_p2gi|6013592 

androgen receptor (dihydrotestosterone r 

Hs.321110 

7 guanine nucleotide binding protein 4 
hepatocyte nuclear factor 3; alpha 
ESTs 

ESTs 
ESTs 

8 ESTs 

6 ESTs; Moderately similar to kynurenine a 

1 ESTs 

ESTs; Weakly similarto !!!! ALU SUBFAMI 



in, Prostate Specific, Alt. Splice 



60 



330772 AA479114 

330786 D60374 

330892 AA149579 

330949 H01458 

330977 H20826 

331017 N24619 

331099 R36671 

331128 R51361 

331151 R82331 

331195 T64447 

331320 AA262999 

331321 AA278355 
331337 AA287662 
331348 AA400596 
331359 AA416979 
331383 AA454543 
331422 F10802 
331442 H77381 
331466 N21680 
331479 N27154 
331490 N32912 
331493 N34357 
331561 N62780 



Hs.315181 
Hs.108920 
Hs.14846 
Hs.268714 



Hs.1 68439 
Hs.300141 
Hs.87929 
Hs.1 18630 
Hs.88143 
Hs.81897 
Hs.43543 
Hs.237339 
Hs.41223 
Hs.43455 
Hs.44076 
Hs.291039 
Hs.93817 
Hs.48703 
Hs.5472 



EST 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs; Moderately similar to !!!! ALU SU 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to hypothetical 43 



331811 AA404500 
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AM17039 


Hs.98268 signal recognition particle 72kD 


7.5 




AA429445 


Hs.98640 ESTs 


6.5 


331889 


AA431407 


Hs.98802 Homo sapiens Chromosome 1 6 BAC clone CIT 


33.6 


331967 


AA460158 


Hs.99589 KIAA1028 protein 


6.8 




AA464518 


Hs.1 05322 ESTs 


5.3 


332043 


AA490831 


H&201591 ESTs 


10.6 




AA599477 


Hs.291156 ESTs 


4.4 


332173 


F09281 


Hs.100725 ESTs 


5.5 






ESTs 


14.2 




N62096 


Hs.1 94140 ESTs 


13. 






Hs.339667 ESTs 


5.6 


332396 


AA340504 


ESTs; Weakly similar to similarto human 


212 


332434 


N75542 


Hs.237731 transcription factor 4 


15.3 


332493 


N95495 


Hs.56729 ESTs; Highly similar to GTP-binding prat 


7.1 


332522 


L38503 


Hs.178357 glutathione S-transferase tfieta 2 


6.6 




AA281753 


Hs.17731 inositol 1;4;5-triphosphate receptor; ty 


5.8 




M31682 


Hs.1 9280 inhibin; beta B (activin AB beta polypep 


5.5 






Hs.325825 folate hydrolase (prostate-specific memb 


38.1 






Hs.20991 ESTs 








Hs.22587 solute carrier family 35 (UDP-galactose 


4.3 


332594 


AA279313 


Hs.32951 methyl CpG binding protein 2 


5.6 




AA412405 


Hs.40513 ESTs; Weakly similar to BETA GALACTOSIDA 


5.6 




N95742 


Hs.6390 ESTs 


6.9 






Hs.75725 carboxypeptidase E 


24.3 




D26070 


Hs.79306 inositol 1;4;5-triphosphate receptor; ty 


9.9 




















AA233258 


ESTs; Weakly similar to D1007.5 [C.etega 




332797 




CH22J=GENES.6_2 


30.8 


332798 




CH22 FGENES.6 5 


66.8 


332799 




CH22_FGENES.6_6 


19.8 


332933 




CH22_FGENES.38_7 


5.6 






CH22 FGENES.54 1 


5.5 






CH22~FGENES.54 6 


4.9 






CH22 FGENES.94J 


4.7 


333169 




CH22~FGENES.94 2 


4.4 






CH22_FGENES.157_1 


4.8 






CH22lFGENES.157_5 


4.3 






CH22~FGENES.157 7 


4.6 


333611 




CH22 FGENES517 6 


4.7 


333621 




CH22>GENESJ219_5 


5.5 
7.1 


333814 




CH22_FGENES.282_2 
CH22 FGENES.290_8 
CH22_FGENES.303l5 


6.2 
4.3 






CH22FGENES.303I7 


4.9 






CH22lFGENES.303_1 1 


5.6 






CH22 FGENES.339 1 




334223 




CH22~FGENES.360_4 


20.3 


334297 




CH22_FGENES.372_3 


9.4 






CH22 FGENES.387_2 


4.6 






CH22 FGENES.387 4 




TW417 




CH22FGENES.387~7 


131 






CH22~FGENES.405_1 1 
CH22~FGENES.427_1 








CH22~FGENES.430 9 








CH22 FGENES465 29 




335179 




CH22~FGENES.504J 




335293 




CH22 FGENES.527J 


4.7 


335550 




CH22_FGENES.576_11 


5.1 


335581 




CH22_FGENES.581_19 
CH22 FGENES.581_25 


5.7 
4.3 


335586 
335809 




CH22_FGENES.617_6 


62 


335810 




CH22_FGENES.617_7 


5.8 


335822 




CH22_FGENES.619_7 


7.1 


335824 




CH22_FGENES.619_11 


8.5 


335853 




CH22 FGENES.626_5 


4.3 


335886 




CH22 FGENES.632_4 


4.3 


336034 




CH22_FGENES.678_5 


6.8 


336441 




CH22_FGENES.827_7 


7.6 
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336624 CH22_FGENES.6-3 43.3 

336625 CH22 FGENES.6-4 37.9 
336679 CH22_FGENES.43-7 5.3 
337577 CH22_C65E1.GENSCAN.8-1 4.9 
338255 CH22_EM:AC005500.GENSCAN.276-3 13.4 
338260 CH22 EM:AC005500.GENSCAN.279-10 4.6 

338561 CH22_EM:AC005500.GENSCAN.421-5 4.6 

338562 CH22_EM:AC005500.GENSCAN.421-6 4.3 
338759 CH22_EM:AC005500.GENSCAN.517-6 5.1 

338763 CH22_EM:AC005500.GENSCAN.517-16 5.5 

338764 CH22_EM:AC005500.GENSCAN.517-17 7.1 
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TABLE 3 A shows the accession numbers for those primekeys lacking unigeneDD's for Table 
3. For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
5 and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 

Accession: Genbank accession numbers 

15 Pkey CAT number Accession 

123619 3716B1J AA602964 AA609200 

116722 143512J Z24878 AA494098 F13654 AA494040 AA143127 

103677 41847J Z83806 AJ132091 AJ132090 

20 125992 1589048J H48372 W01626 

109342 genbankJU\213620 AA213620 

125154 genbank_W3B419 W38419 

101447 entrez_M21305 M21305 

124357 genbank.N22401 N22401 

25 108910 genbank_AA1365S0 AA136590 

322278 47271J W69304 AF086283 W692CO 

315084 350959J AI821085 AW973464AA554802 AI821831 AA657438 AA640756 AA650339 

324019 262792J AW177009 AI381610 

324330 300543J AA884766 AW974271 AA592975 AA447312 

30 324626 336411J AI685464 AW971336 AA513587 AA525142 

303029 37699J AF199613 AF108756 

324804 398093J AI692552 AI393343 AI80051 0 AI37771 1 F24263AA661876 

324961 376239J 

329362 cjUis 

35 336624 CH22_4071FG_6_3_ 

336625 CH22 4072FG_6_4_ 

336679 CH22 4157FG_43_7_ 

338255 CH22_6856FG_UNK_EM:AC00 

338260 CH22_6863FG_UNK_EM:AC00 

40 329929 c16_p2 

338561 CH22 7294FG_UNOMAC00 

338562 CH22 7295FG_LINK_EMAC00 
338759 CH22_7581FG_LINK_EM:AC00 

45 338763 CH22_7585FG_LINK_EM:AC00 
338764 CH22_7586FG_LINK_EMAC00 

333168 CH22 400FG 94_1_LINK_EMA 

333169 CH22_401FG_94_2_LINK_EM:A 
333452 CH22 702FG_157_1_LINK_EM: 

50 333456 CH22 706FG_157_5_LINK_EM: 
r 333458 CH22_708FG_157_7_LINK_EM: 

333611 CH22 872FG_217_6_LINK_EM: 

333621 CH22_882FG_219_5JJNK_EM: 

333814 CH22_1083FG_282_2_LINK_EM 
55 333849 CH22_1118FG_290_8_LINK_EM 

335179 CH22_2515FG_504_9_UNK_EM 

333949 CH22_1225FG_303_5_LINK_EM 

333951 CH22_1227FG_303_7_LINK_EM 

333955 CH22 1231FG_303_11_UNK_E 
60 335293 CH22_2635FG_527_6_LINK_EM 

326816 c20_hs 

326997 C21_hs 

335550 CH22_2905FG_576_11 LINK_E 
335581 CH22_2938FG_581_19_LINK_E 
65 335586 CH22_2944FG_581_25_LINK_E 
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328492 c_7_hs 

335809 CH22_3181FG 617_6_LINKLEM 

335810 CH22_3182FG 617_7_LINK_EM 
335822 CH22_3195FG_619_7JUNK_EM 
335824 CH22_3197FG 619_11_LINK_E 
335853 CH22_3228FG_626_5_LINK_EM 
335886 CH22_3261FG 632_4_LINK_EM 
330020 c16_p2 

330211 c_5_p2 

337577 CH22_5864FG_LINK C65E1.G 
307848 AI364186 

332797 CH22_13FG_6 2_LINK_C4G1.G 

332798 CH22_14FG_6_5_LINK_C4G1.G 

332799 CH22_15FG_6 6_LINK_C4G1.G 
334150 CH22_1429FG_339_1_LINK_EM 
332933 CH22_154FG_38_7_LINK_C20H 
332980 CH22 204FGJ4J_UNK_EMA 
332984 CH22_208FG_54_6_UNK_EM:A 
334223 CH22_1507FG_360_4_LINK.EM 
334297 CH22_1588FG_372_3_UNK_EM 
327098 C21_hs 

334443 CH22J742FG_387_2_LINK_EM 

334444 CH22_1743FG_387_4_LINK_EM 
334447 CH22_1746FG_387_7JJNK_EM 
334570 CH22_1875FG_405_11_UNK_E 
334749 CH22_2061FG_427_1_LINK_EM 
334777 CH22_2089FG_430_9_LINK_EM 
333034 CH22_3419FG 678_5_LINK_DJ 
334960 CH22 J .2281FG_465_29_LINK_E 
336441 CH22_3861FG_827_7_LINK_DJ 

330551 9851_2 U39840 NMJ304496 AW135607 BE087458 BE087567 M177116 AW195705 AW750756 A1811008 A1694151 

BE348594 AW971075 AI347950 AI201455 AI073898 AA652680 AA613671 AI318364 AA507550 AA693692 
AI032599 AA991871 AI269801 AW948974 T74639 AA532907 AW949173 

330786 53973_3 BE379594 AI192455 AL039862 AI744012 AI761735 AW243181 AI743687 AI928223 AI423022 AI627855 

AIG3G059 AI651571 AW802044 AI826995 AI431733 AI539125 AA863056 AW270910 AI768930 AW0O8835 
AW615183 AW591147 AI695294 AI672106 AA506358 AI308060 AA01 1556 AA962437AI935488 BE219G25 
AI004356 AW151394 AI218466 N66178 AI419784 AW24251 9 AW946907 D60374 M989263 AI698799 
AA470460AI824167 

332247 372939 1 AA669097AA513815 AA026798AA676526 M704429AA704269AW1 18292 AA579216N58172 

332396 20265 1 AW579842 BE156562BE156690BE156489BE081033AK001559 BE149402 M85387AW367811 AW367798 

R17370 AI908847 AA382932 R58449 H18732 AA371231 AW962899 M713530 AW892946 R53463 H1 1063 
AW068542 Z40761 BE176212 BE176155 W23952 W92188 AW374883 AA303497 AW954769 AA036808 
BE168063AW382073 AW382085AL041475 H80748 AI078161 BE463983AI805213AI761264 W94885 
N94502 AI623772 AI419532 AI810302 AI634190 AW002516 AW150777 AI352312 AI367474 AW204807 
AI675502 AI337026 AW134715 BE328451 AI1231 57 AI580020 AI300745 AI608631 AI248873 AA742484 
AW051635 H18646 AI245045 AA5071 1 1 AI640510 AI925594 AA1 15747 M143035 AA151106 
332781 32044J AK001764 BE31389S AA380199 AA380151 M194996 AW1 18089 AA495871 AW975219 AW085598 

AI378909 AW992310 AW992409 AI911857 M657643 AI804471 AI242589 AI623968 R09556 AI129100 
AI206500 AA680094 AA677784 AI023178 AI277519 AA424742 AI240654 AA232846 AI804273 AI382376 
AA001729 W90790 BEO90656 AW295015 AI674596 AI431734 AI420517 AW769185 AI128355 AI192474 
AI820O01 AA001929 AA706925 AI076676 AI4991 19 AI200493 AI895919 AI376217 W69195 W69261 
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TABLE 3B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in Table 3. For each predicted exon, we have listed the genomic sequence 
source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. at al." refers to the 

publication entitled The DNA sequence of human chromosome 22." Dunham 1. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposition 


333611 Dunham, l.etal. 


Plus 


6548368-6548507 


333621 Dunham, Letal. 


Plus 


8597414-8597560 


333814 Dunham, Letal. 


Plus 


7894165-7894252 


333849 Dunham, 1. et.al. 


Plus 


8018323-8018472 


333949 Dunham, 1. eta!. 


Plus 


8589634-6589791 


333951 Dunham, Letal. 


Plus 


8592501-8592637 


333955 Dunham, l.etal. 


Plus 


8597414-8597560 


334150 Dunham, Letal. 


Plus 


10529221-10529854 


334297 Dunham, Letal. 


Plus 


13420934-13421058 


334443 Dunham, Letal. 


Plus 


14298981-14299056 


334444 Dunham, Letal. 


Plus 


14306433-14306492 


334447 Dunham, Letal. 


Plus 


14308764-14308824 


334570 Dunham, l.etal. 


Plus 


14994868-14994943 


334777 Dunham, Letal. 


Plus 


16259586-16260166 


335179 Dunham, Letal. 


Plus 


21634405-21634526 


335581 Dunham, Letal. 


Plus 


24976198-24976334 


335586 Dunham, Letal. 


Plus 


24990333-24990497 


335809 Dunham, l.etal. 




26310772-26310909 


335810 Dunham, I. etal. 


Plus 


26314767-26314849 


335822 Dunham, I. etal. 


Plus 


26364087-26364196 


335824 Dunham, I. etal. 


Plus 


26376860-26376942 


335886 Dunham, Letal. 


Plus 


26934235-26934364 


336034 Dunham, Letal. 


Plus 


29014404-29014590 


336441 Dunham, Letal. 


Plus 


34187606-34187663 


337577 Dunham, I. etal. 


Pius 


595377-595678 


338260 Dunham, I. etal. 


Plus 


15458919-15459257 


332797 Dunham, I. etal. 


Minus 


216964-216798 


332798 Dunham, I. etal. 


Minus 


232147-231974 


332799 Dunham, I. etal. 


Minus 


232421-232307 


332933 Dunham, Letal. 


Minus 


2035790-2035681 


332980 Dunham, Letal. 


Minus 


5136165-5136019 


332984 Dunham, I. etal. 


Minus 


2632606-2632457 


333168 Dunham, Letal. 


Minus 


3729896-3729788 


333169 Dunham, 1. etal. 


Minus 


3730864-3730767 


333452 Dunham, l.etal. 


Minus 


5136165-5136019 


333456 Dunham, l.etal. 


Minus 


2631933-2631797 


333458 Dunham, 1. etal. 


Minus 


5143942-5143806 


334223 Dunham, l.etal. 


Minus 


12734365-12734269 


334749 Dunham, 1. etal. 




16090686-16090106 


334960 Dunham, 1. etal. 


Minus 


20160968-20160795 


335293 Dunham, 1. etal. 




22316408-22316275 


335550 Dunham, 1. etal. 




24668714-24668658 


335853 Dunham, 1. etal. 


Minus 


26614629-26614506 


336624 Dunham, 1. etal. 


Minus 


227714-227577 


336625 Dunham, 1. etal. 




229124-229024 


336679 Dunham, 1. etal. 


Minus 


2035790-2035681 


338255 Dunham, 1. etal. 


Minus 


15242294-15242231 


338561 Dunham, 1. etal. 


Minus 


22311966-22311856 


338562 Dunham, 1. etal. 




22312594-22312465 


338759 Dunham, 1. etal. 


Minus 


26582475-26582199 


338763 Dunham, 1. etal. 


Minus 


26628148-26628009 


338764 Dunham, Letal. 


Minus 


26641232-26641101 
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329960 5091594 

329929 6165201 

330020 6671887 

326816 6552458 

326997 5867660 

327098 6682516 

330211 6013592 

328492 5868455 

329362 5868837 



Minus 1031-1162 

Minus 156410-156553 

PIUS 172397-172491 

Plus 198354-198436 

Minus 71389-72147 

Minus 1061684-1062361 

Plus 59158-59215 

Minus 46094-46241 

Minus 65688-68173 
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TABLE 4: shows a preferred subset of the Accession numbers for genes found in Table 3 
which are differentially expressed in prostate tumor tissue compared to normal prostate 
tissue. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn 


UnigeneiD Unigene Title 


R1 


100819 HG4020-HT4290Hs2387 Transglutaminase 


10.5 


102698 U75272 


Hs.1867 progastricsin (pepsinogen C) 




102869 X02544 


Hs.572 orosomucoid 1 


22.9 


105370 AA236476 


Hs.22791 ESTs; Weakly similar to transmembrane pr 


10.3 


105645 AA282138 


Hs.11325 ESTs 


14 


106094 AA419461 


Hs.23317 ESTs 


10.9 


109014 AA1 56790 


Hs.262036 ESTs 


15.3 


109562 F01811 


Hs.187931 ESTs; Moderately similar to voltage-gate 


10.8 


113021 T23855 


Hs.129836 KIAA1028 protein 


10.8 


114124 Z38595 


Hs.125019 ESTs; Highly similar to KIAA0886 protein 


21.3 


122791 AA460158 


Hs.129836 KIAA1028 protein 


12.4 


124352 N21626 


Hs.102406 ESTs 


102 


301042 AI659131 


Hs.197733 ESTs 


24.9 


302005 AI869666 


Hs.123119 ESTs 


36.3 


302410 NM_004917 


Hs218366 EST cluster (not in UnlGene) with exon h 


26.8 


302881 AA508353 


Hs.105314 relaxin 1 (H1) 


78.8 


303344 AA255977 


Hs.250646 ESTs; Highly similar to ubiquitin-conjug 


19.5 


303753 AW503733 


Hs.9414 ESTs 


13 


310431 AI420227 


Hs.149358 ESTs 


72.9 


311251 AI655662 


Hs.197698 ESTs 


41.3 


311596 AI682088 


Hs.79375 ESTs 


26.4 


312153 AA759250 


Hs.118625 cytochrome b-561 


11 


312521 AA033609 


Hs.239884 ESTs 


112 


313676 AA861697 


Hs.120591 EST cluster (not in UniGene) 


13.4 


314171 AI821895 


Hs.193481 ESTs 


29.4 


314907 AI672225 


Hs.222886 ESTs 


19.3 


315051 AW292425 


Hs.163484 EST 


15.5 


315052 AA876910 


Hs.134427 ESTs 


20 


317548 AI654187 


Hs.195704 ESTs 


142 


317869 AW295184 


Hs.129142 ESTs; Weakly similar to DEOXYRIBONUCLEAS 13.6 


318428 AI949409 


Hs.194591 ESTs 


12.3 


318524 AW291511 


Hs.159066 ESTs 


25.9 


319080 Z45131 


Hs.23023 ESTs 


16.9 


319763 AA460775 


Hs.6295 ESTs 


14.3 


320324 AF071202 


Hs.139336 ATP-binding cassette; sub-family C (CFTR 


562 


321441 AW297633 


Hs.118498 ESTs 


14.7 


322303 W07459 


Hs.157601 EST cluster (not in UniGene) 


22 


322782 AA056060 


Hs.202577 EST cluster (not in UniGene) 


18.4 


322818 AW043782 


Hs.293616 ESTs 


10.7 


323287 AA639902 


Hs.104215 ESTs 


24.7 


324603 AW016378 


Hs292934 ESTs 


242 


324617 AA508552 


Hs.195839 ESTs 


54 


324658 AI694767 


Hs.129179 ESTs 


22 


324691 AI217963 


Hs.293341 ESTs; Weakly similar to Pro-a2(XI) [H.sa 


10.5 


324698 M641092 


Hs.257339 ESTs 


102 


324718 AI557019 


Hs.116467 ESTs 


34.4 


330211 


CH.05_p2gi|6013592 


12.6 


330430 HG2261 -HT2352 Hs.321 1 10 Antigen, Prostate Specific, AIL Splice 


13.3 


330706 AA121140 


Hs.177576 ESTs; Moderately similar to kynurenine a 


145 


330762 AA449677 


Hs.15251 Human DNA sequence from clone 437M21 on 18.5 


330892 AA149579 


Hs.91202 ESTs 


15.3 


330949 H01458 


Hs.142896 ESTs 


10.3 
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331099 R36671 Hs.14846 ESTs 11.6 

331151 R82331 Hs.268838 ESTs 13 

331889 AA431407 Hs.98802 Homo sapiens Chromosome 16 BAC clone CIT 33.6 

332247 N58172 ESTs 14.2 

332396 AA340504 ESTs; Weakly similar to similarto human 21.2 

332533 M99487 Hs.325825 folate hydrolase (prostate-specific memb 38.1 

332697 T94885 Hs.75725 carboxypeptidasa E 24.3 

332797 CH22_FGENES.6_2 30.8 

332798 CH22_FGENES.6_5 66.8 

332799 CH22_FGENES.6_6 19.8 
334223 CH22_FGENES.360_4 20.3 

336624 CH22_FGENES.6-3 43.3 

336625 CH22_FGENES.6-4 37.9 
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TABLE 4A shows the accession numbers for those primekeys lacking unigenelD's for Table 
4. For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
5 and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



10 Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 

Accession: Genbank accession numbers 



15 Pkey CAT number Accession 

336624 CH22_4071FG 6_3_ 

336625 CH22_4072FG 6_4_ 
330211 C_5_p2 

20 332797 CH22_13FG_6_2_LINK_C4Gi.G 

332798 CH22_14FG_6_5_UNK_C4G1.G 

332799 CH22_15FG_6_6_LINK_C4G1.G 
334223 CH22_1507FG_360_4_LINK_EM 
332247 372969J 

25 332396 20265J 



30 



AA669097 AA513815 M026798 M676526 AA704429 AA704269 AW1 18292 AA579216 N58172 
AW579842 BE156562 BE156690 BE156489 BE081033 AK001559 BE149402 M85387 AW367811 
AW36779B R17370 AI908947 AA382932 R58449 H18732 AA371231 AW962899 M713530 AW892945 
R53463 H1 1063 AW068542 Z40761 BE176212 BE176155 W23952 W92188 AW374883 AA303497 
AW954769 AA036808 BE168063 AW382073 AW382085 AL04147S H80748 AI078161 BE463983 
AI805213 AI761264 W94885 N94502 AI623772 AI419532 AI810302 AI634190 AW002516 AW150777 
AI352312 AI367474 AW204807 AI675502 AI337026 AW134715 BE328451 AI123157 AI550020 
AI30074S AI608631 AI248873 AA742484 AW051635 H18646 AI245045 AA507111 AI640510 AI925594 
M1 15747 M143035AA151 106 
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TABLE 4B shows the genomic positioning for those primekeys lacking unigene DD's and 
accession numbers in Table 4. For each predicted exon, we have listed the genomic sequence 
5 source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (61) numbers. "Dunham I. et si." relers to the publication entitled The 
DNA sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



15 Pkey Ref 

332797 Dunham, I. et.al. 

332798 Dunham, I. etal. 

332799 Dunham, I. etal. 
20 334223 Dunham, I. etal. 

336624 Dunham, I. et.al. 

336625 Dunham, I. etal. 
330211 6013592 



Strand NLposltion 

Minus 216964-216798 

Minus 232147-231974 

Minus 232421-232307 

Minus 12734365-12734269 

Minus 227714-227577 

Minus 229124-229024 

Plus 59158-59215 
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TABLE 5: 1170 GENES UP-REGULATED IN PROSTATE CANCER COMPARED TO 
NORMAL ADULT TISSUES 

Table 5 shows 1 170 genes up-regulated in prostate cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip array such 
that the ratio of "average" prostate cancer to "average" normal adult tissues was greater than 
or equal to 3.44. The "average" prostate cancer level was set to the 85 th percentile amongst 73 
prostate cancers. The "average" normal adult tissue level was set to the 85 th percentile 
amongst 162 non-malignant tissues. In order to remove gene-specific background levels of 
non-specific hybridization, the 7.5 th percentile value amongst the 162 non-malignant tissues 
was subtracted from both the numerator and the denominator before the ratio was evaluated. 



Pkey: 
ExAccn: 




Unique Eos probeset identifier number 






Exemplar Accession number, Genbank accession number 




UnigenelD: 


Unigene number 






Unigene Title: 


Unigene gene title 




R1; 




Ratio of tumor to 


normal tissue 




Pkey 


ExAccn 


UnigenelD 


Unigene Title 




446057 


AI420227 


Hs.1 49358 


ESTs, Weakly similar to A4601 0 X-linked 


86 42 


400302 


N48056 


Hs.1915 


folate hydrolase (prostate-specific memb 




414569 


AF109298 


Hs.1 18258 


prostate cancer associated protein 1 




417407 


AA923276 


Hs.290905 


ESTs, Weakly similar to protease [H.sapi 




431579 


AW971082 


Hs.222886 


ESTs, Weakly similar to TRHY_HUMAN TRICH 




409361 


NM_00598! 


> Hs.54416 


sine oculis homeobox (Drosophila) homolo 




409731 


AA125985 


Hs.56145 


thymosin, beta, identified In neuroblast 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


4348 


420154 


AI093155 


Hs.95420 


JM27 protein 


41/12 


433466 


AA508353 


Hs.105314 


relaxin 1 (HI) 


39.B8 


400296 


AA305627 


Hs.1 39336 


ATP-binding cassette, sub-family C (CFTR 


38.42 
36.00 


400292 


AA250737 


Hs.72472 


ESTs 


432887 


AI926047 


Hs.162859 


ESTs 


36.48 


439176 


AI446444 


Hs.190394 


ESTs, Weakly similar to B28096 line-1 pr 


36.45 


430722 


AW968543 


Hs.203270 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


3350 
33.02 


437052 


AA861697 


Hs.120591 


ESTs 


418396 


AI765805 


Hs.26691 


ESTs 


32.68 


434036 


AI659131 


Hs.197733 


hypothetical protein MGC2849 


32.44 


407709 


AA456135 


Hs.23023 


ESTs 


32.10 


426747 


AA535210 


Hs.171995 


kallikrein 3, (prostate specific antigen 


31.80 


407168 


R45175 




ESTs 


31.72 


440260 


AI972867 


Hs.7130 


copine IV 


30.52 


421513 


X00949 


Hs.105314 


relaxin 1 (HI) 


30.10 


416370 


N90470 


Hs.203697 


ESTs, Weakly similar to 138022 hypotheti 


29.68 


407122 


H20276 


Hs.31742 


ESTs 


29.24 


400287 


S39329 


Hs.181350 


kallikrein 2, prostatic 


28.90 


432244 


AI669973 


Hs.200574 


ESTs 


28.74 


451939 


U60456 


Hs.27311 


single-minded (Drosophila) homotog2 


28.74 


415989 


AI267700 


Hs.1 11 128 


ESTs 


28.34 


418961 


AW967646 


Hs.23023 


ESTs 


27.34 


425628 


NMJM4476 Hs.1915 


folate hydrolase (prostate-specific memb 


27.32 


458509 


AA654650 


Hs.282906 


ESTs 


2754 


448290 


AK002107 


Hs.20843 


Homo sapiens cDNA FU11245 fis, clone PL 


27.16 


428336 


AA503115 


Hs.183752 


microseminoprotein, beta- 


26.17 


450095 


AI682088 


Hs.223368 


holocarboxylase synthetase (biotin-[prop 


25.60 


400299 


X07730 


Hs.171995 


kallikrein 3, (prostate specific antigen 


24.91 


437571 


AA760894 


Hs.153023 


ESTs 


24.74 


453160 


AI263307 


Hs.146228 


H2B histone family, member L 


24.66 


453096 


AW294631 


Hs.1 1325 


ESTs 


24.46 


425075 


AA506324 


Hs.1 852 


acid phosphatase, prostate 


2453 


407202 


N58172 


Hs.109370 


ESTs 


24.18 
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AU077324 


















422805 


AA436989 


Hs,121017 


H2A histone family, membor A 


2252 






Hs 144997 










Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 




413597 


AW302885 


Hs.117183 




2176 


426429 




Hs. 169849 










Hs. 188620 






432966 


AA650114 










AI820961 


Hs. 193465 


















BE568568 


HS.195704 






418819 


AA228776 


Hs 191721 


EST 


IBM 






Hs 99915 








AA296520 


Hs 89546 


selectin E (endothelial adhesion molecul 














415539 


AI733881 


Hs .72472 


ESTs 






AA397658 


Hs.60257 








AA516531 








429984 


AL050102 


Hs .227209 


hypofteUc^proteinlFU2?6'l 7 ^ 


17R? 




AI087412 


Hs 143611 


ESTs Weakly similar to 2004399 A chromos 










gb:tt88f04.x1 NCI_CGAP_Pr28 Homo sapiens 






AW023630 










AW292425 










AB037765 




KIAA1344 protein 






AB 006631 










AA502972 


Hs 183390 










Hs 241552 


KIAA0268 protein 










fatty-acid-Coenzyme A ligase, long-chain 










gbinp12d1 1 .s1 NCl_CGAP_Pr3 Homo sapiens 
















NM_014253 


Hs.23796 


odz (odd Oz/ton-m, Drosopnila) homolog 1 












1fi9« 




AI357412 


Hs. 157601 






453006 


A1362575 


Hs 167133 


EST 






AL1 33990 


Hs 190642 








AW958037 


Hs 22437 






408000 
















ESTs, Weakly similar to MUG2_HUMAN MUCIN 




430226 


BE245562 


Hs2551 




1540 




AA948033 


Hs 130853 


















AI249368 


Hs 98558 








AA460421 






















Hs.68583 


mitochondrial intermediate peptidase 




435974 


U29690 


Hs 37744 




14 7R 


444484 


AK002126 




hypothetical piotsh FU11264 








ns. 103262 


ESTs, Weakly similar to ZN91_HUMAN ZINC 






AA279490 


Hs.86366 


catmegin 






AF1 79274 




transmembrane protein with EGF-like and 


ilal 


44RBFK 












AW838616 










AA527941 




gb:nh30c04.s1 NCI_CGAP_Pr3 Homo sapiens 










guanylate cyclase 1 , soluble, alpha 3 






AF203032 


Hs.198760 


neurofilament, heavy polypeptide (200kD) 




419264 


AA877104 


Hs.293672 


ESTs, Weakly similar to ALUB HUMAN !!!! 


13.40 


416445 


AL043004 


Hs.300678 


KIAA0135 protein 


13.32 


407275 


AI364186 




gb:qw34h07.x1 NCI_CGAP_Ut4 Homo sapiens 


13.24 


408369 


R38438 


Hs.182575 


solute carrier family 15 (H-t/peptidetra 


1321 


446720 


AI439136 


Hs.140546 


ESTs 


13.05 


434988 


AI418055 


Hs.161160 


ESTs 


13.02 


448172 


N75276 


Hs.135904 


ESTs 


12.93 


416182 


NM_004354 


Hs.79069 


cyclin G2 


12.94 


420544 


AA677677 


Hs.98732 


Homo sapiens Chromosome 16 BAC clone CIT 


12.79 


445413 


AA151342 


Hs.12677 


CGI-147 protein 


12.64 


452538 


AA889120 


Hs.1 10637 


homeoboxA10 


12.62 


407819 


R42185 


HS274803 


ESTs 


12.60 


433444 


AW975324 


Hs.129816 


ESTs 


12.60 
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thyroid receptor interacting protein 15 




AW512260 


Hs.87767 




453930 


AA419466 


Hs.36727 


hypothetical protein FU10903 




AW576148 


Hs.148376 






AA013140 


Hs.1 15707 






AW753676 












cholinergic receptor, muscarinic 3 


443912 


R37257 


Hs.1 84780 






AI821895 


Hs.193481 






BE252922 


Hs.1231 19 


MAD (mothers against decapentaplegic, ur 




BE463857 


Hs.151258 


hypothetical protein FU21062 




AA1 47224 








AF097994 


Hs.301528 


L-kynurenine/alpha-aminoadipateaminotra 




AI1274B3 


Hs.120451 


ESTs, Weakly similar to unnamed protein 




AA9 09358 


Hs.128612 




448045 


AJ297436 


Hs.20166 


prostate stem cell antigen 




AW2Q4232 


Hs.279522 






AW770553 


Hs.293640 


sterol O-acyltransferase (acyl-CoenzymB 




AB032959 


Hs.161700 


novel C3HC4 type Zinc finger (ring tinge 






Hs.127574 






AF05S575 


Hs.297647 


calcium channel, voltage-dependent, L ty 


431173 


AW971198 


Hs.294068 




434539 


AW748078 


Hs.214410 


ESTs, Weakly similar to MUC2JHUMAN MUCIN 




AB020725 




KIAA0918 protein 


417708 


N74392 


Hs.50495 








Hs.220491 








Hs.301782 


phosphodiesterase 3B, cGMP-inhibited 


425665 


AK001050 


Hs.159066 


hypothetical protein FLJ10188 




AF030880 




solute carrier family, member 4 


428728 


NM_016625 


Hs.191381 


hypothetical protein 








gb:Human marinerl transposa.se gene, comp 




D84284 


Hs.66052 


CD38 antigen (p45) 










434485 


AI623511 


Hs.1 18567 






AW419196 


Hs.257924 


hypothetical protein FLJ13782 


452340 


NM_OO2202 




ISL1 transcription factor, LIM/homeodoma 


453628 


AW243307 


Hs.170187 


hypothetical protein 


408063 


BE0B6548 


Hs.42346 


caldneurin-binding protein calsarcin-1 


417687 


AI828596 


Hs.250691 


§i ST S 




AF151103 


Hs.1 12259 


T cell receptor gamma locus 




W68815 


Hs.301835 


Homo sapiens cDNA FLJ11346 lis, clone PL 


428819 


AL1 35623 


Hs.193914 


KIAA0575 gene product 








DEADiH (Asp-GIu-Ala-Asp/His) box polypep 


















441690 


R81733 


Hs.33106 


u STS u. .-1 • 




AW503756 


Hs.286184 


hypothetical protein dJ551 D2.5 


421991 


NMJM918 


Hs.1 10488 


KIAA0990 protein 


423698 


AA329796 


Hs.1098 


DKFZp434J1813 protein 


452039 


AI922988 


Hs.172510 








Hs.1 2501 9 


ESTs 


433927 


A1557019 


Hs.1 16467 


small nuclear protein PRAC 


445424 


AB028945 


Hs.12696 


cortadin SH3 domain-binding protein 


432240 


AI694767 


Hs.129179 


Homo sapiens cDNA FU13581 fis, clone PL 




AL043002 


Hs.128246 


ESTs, Moderately similar to unnamed prot 


452744 


AI267652 


Hs.30504 


Homo sapiens mRNA; cDNA DKFZp434E082 (fr 




NM 013427 


Hs.250830 


Rho GTPase activating protein 6 


427398 


AW390020 


Hs.20415 


chromosome 21 open reading frame 1 1 


446896 


T15767 


Hs.22452 


Homo sapiens mRNA tor KIAA1737 protein, 


421470 


R27496 


Hs.1 378 


annexinA3 


406554 
401424 








407902 


AL1 17474 


Hs.41181 


Homo sapiens mRNA; cDNA DKFZp727C191 (fr 


423545 


AP000692 


Hs.129781 
Hs.35598 


chromosome 21 open reading frame 5 
ESTs 


439024 
431548 


R96696 
AI834273 


Hs.9711 


novel protein 


409262 


AK000631 


Hs.52256 


hypothetical protein FU20624 


446271 


D82484 


Hs.100469 


ESTs 


448692 


AW013907 


Hs.224276 


methylcrotonoyl-Coenzyme A carboxylase 2 



12.20 
12.18 
12.16 
12.04 



11.87 
11.86 
11.76 
11.68 
11.60 
11.60 
11.51 
11.50 
11.38 
11.33 
11.32 
11.18 
11.16 
11.16 
11.14 
11.14 
11.12 
11.10 
11.08 
11.08 
11.04 
11.02 
11.02 



10.87 
10.85 
10.72 
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10.10 
10.04 
10.02 
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414140 AA281279 

435980 AF274S71 

421246 AW582962 

427304 AA761526 

442914 AW1 88551 

413627 BE182082 

439699 AF086534 

437718 AI927288 



410001 
424012 
441791 
448206 



411630 
421863 
454141 
418278 



424906 
415245 
442409 
404571 



AW372449 

SE622585 

AA298489 

AA401863 

AA814043 

U42349 

AI952677 

AW138413 



Hs.23317 
Hs.129142 
Hs.300961 
Hs.1 63853 
Hs.99519 



hypothetical protein FLJ14681 



Hs.57771 
Hs.137569 
Hs.175982 
Hs.3731 

Hs.22380 
Hs.88045 
Hs.71119 
Hs.108972 



Hs.153716 
Hs.27252 
Hs.129544 



CGI-47 protein 
ESTs 

hypothetical protein FU14007 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 
ESTs 

Homo sapiens mRNA full length insert cDN 
Homo sapiens, Similar to RIKEN cDNA 2010 
hypothetical protein FLJ12969 
kallikrein 11 

tumor protein 63 kDa with strong homolog 

hypothetical protein FU21 159 

ESTs, Moderately similar to 138022 hypot 

olfactory receptor, family 51 , subfamily 

ESTs 

ESTs 

Putative prostate cancer tumor suppresso 
Homo sapiens mRNA; cDNA DKFZp434P228 (fr 
ATP-binding cassette, sub-family C (CFTR 
hypothetical protein 
matrix metalloproteinase 7 (matrilysin, 
ESTs, Weakly similar to A43932 mucin 2 p 
Homo sapiens mRNA for Hmob33 protein, 3' 
ESTs 

hypothetical protein MGC15438 



405876 
448807 
445372 
425171 
419963 
407385 
433172 
422631 
412719 
418849 
444922 
427674 
432101 
416288 
404915 
440106 



AI571940 
N3S417 
AW732240 Hs.300615 



AA317439 
AI041530 
AW511443 
N75582 



Hs.127699 
Hs.57787 
Hs.28707 
Hs.132107 
Hs.258110 
Hs.212875 

Hs.30343 
Hs.45140 
Hs.7323 
Hs.287631 
Hs.1 05069 
Hs.1 02720 



448212 
449532 
413930 
458191 



AL038450 Hs.48948 



W74653 
M86153 
AI420S11 
AI1 99738 
457498 AI73223Q 



Hs.271593 
Hs.75618 
Hs.127832 



ESTs 
ESTs 
ESTs 

interleukin 6 (interferon, beta 2) 
ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein ASH1 
hypothetical protein FU1Q688 
ESTs 

Homo sapiens PIG-M mRNA for mannosyltran 
Homo sapiens cDNA FU13752 fis, clone PL 
H2B histone family, member Q 
EphA3 

gb:yp07c06.s1 Scares breast 3NbHBst Homo 

KIAA1603 protein 
ESTs 

signal sequence receptor, gamma (translo 

ESTs 

ESTs 

ESTs, Weakly similar to DYH9_HUMAN CILI 
regulator of mitotic spindle assembly 1 
ESTs 

hypothetical protein FU14084 

nuclear receptor co-repressor/HDAC3 comp 

Homo sapiens cDNA FU14269 fis, clone PL 



gb:tc87d07.x1 NCI_CGAP_CLL1 Homo sapiens 
ESTs, Moderately similar to A47582 B-cel 
RAB11A, member RAS oncogene family 
ESTs 

ESTs, Weakly similar to ALUA_HUMAN III! 
ESTs 

SAC2 (suppressor of actin mutations 2, y 
Homo sapiens cDNA: FU23603 fis, clone L 
ESTs 
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NM_006207 


Hs.1 70040 










Hs.1 06795 


Homo sapiens cDNA PLJ13136 lis, clone Nl 








Hs.87159 








AW262580 


Hs.1 47674 
















416857 


AA1 88775 


Hs.292453 










Hs.35406 


EST Hi hi I 'I t d 


766 




AB023198 


Hs.158135 


KIAA0981 protein 




420120 


AL049610 




transcription elongation factor A (Sll)- 






AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 




446307 


T50083 


Hs.9094 








AW20720G 






759 




AW295230 










AA447990 










AW369351 




Homo sapiens cDNA FU13090 fis clone NT 














448706 


AW291095 




interleukin 20 r ^P lor " a 'P na 






AB009284 


Hs 61152 




749 


431616 


AA508552 


Hs.195839 


ESTs, weakly similar to I3802<2 nypotneti 






AW014795 






744 






Hs.256398 


Homo sapiens mRNA; CDNA Dr\rZp434c0528 (T 




448519 


AW1 75665 


Hs.244334 


Homo sapiens prostein mRNA, complete cds 






A1632278 








419743 


AW408762 




Homo sapiens clone 24416 mHNA sequence 






BE247129 


Hs.145569 








M18667 




progastncsin (pepsinogen CJ 






AA406293 


Hs.301 622 






400294 




Hs.179809 


Homo sapiens prostein mRNA, complete cds 






AW292779 


Hs.1 69799 








AM11101 


Hs.221750 


nuclear autoantigenic sperm protein (nis 






AW974812 


Hs.291971 






418432 


M14156 


Hs.85112 


insulin-lite growth factor 1 (somatomedi 






AW1 18822 






721 




BE267154 


Hs.125752 








AJ0O0098 


Hs.94210 






425018 


BE245277 


Hs.154196 


E4F transcription factor 1 






AA1 58727 


Hs.150555 




718 


435380 


AA679001 


Hs.1 92221 








AW965215 


Hs.1 30707 


F<5T 


7 12 


408291 


AB023191 




KIAA0974 protein 




409110 


AA191493 


Hs.48778 


niban protein 




414485 




Hs.1 82625 


VAMP (vesicle-associated membrane protei 




430039 




Hs.1 53400 


ESTs, Weakly similar to ALU1 _HUMAN ALU S 






AW970602 








417153 


X57010 


IJ S Q0197 


collagen, type II, alpha 1 (primary oste 






AI768015 






707 


412953 


Z45794 


Hs.238809 






418051 


AW1 92535 




CCT 






NM_000399 






704 


446999 


AA1 51 520 


Hs.279525 


hypothetical protein MGC4485 




440529 


AW207640 




Homo sapiens cDNA: FU21718 fis, clone C 


7R4 




AI806867 


Hs. 126594 




701 


451027 


AW519204 


Hs.40808 


E 5 TS u„r win 


™ 


408432 


AW1 95262 




gb:xn67b05.x1 NCI_CGAP_CML1 Homo sapiens 




432223 


AA333283 


Hs.285336 


Homo sapiens, clone IMAGE:3460280, mRNA 




444805 


AB007899 




homolog of yeast ubiquiSn-protein ligas 






AA136569 


Hs.295940 


KIAA01 87 gene product 


6^98 


431725 


XS5724 


Hs.2839 


Nome disease (pseudoglioma) 


6.98 


449685 


AW296669 


Hs.66095 


ESTs 


6.97 


447313 


U92981 


Hs.18081 


Homo sapiens clone DT1P1B6 mRNA, CAG rep 


6.96 


424590 


AW966399 


Hs.46821 


hypothetical protein FLJ20086 


6.94 


449655 


AI021987 


Hs.59970 


ESTs 


6.92 


419563 


AA526235 


Hs.193162 


Homo sapiens cDNA FU11983 fis, clone HE 


6.90 


434163 


AW974720 


Hs.25206 


group XII secreted ptiospholipase A2 


6.89 


415809 


Z32789 


Hs.46601 


ESTs 


6.86 


425782 


U65468 


Hs.159525 


cell growth regulatory with EF-hand doma 


6.85 


41795B 


AA767382 


Hs.193417 


ESTs 


6.84 


427408 


AA583206 


Hs.2156 


RAR-related orphan receptor A 


6.79 


445873 


AA250970 


Hs.251946 


poly(A)-binding protein, cytoplasmic 1-1 


6.74 
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410718 


AI920783 


Hs.191435 






432363 


AA534489 




gb:nf76g11.s1 NCI_CGAP_Co3 Homo sapiens 




436521 


AW203986 


Hs.213003 


ESTs 




435604 


AA625279 


Hs.26892 


uncharacterized bona marrow protein BM04 




419063 


AI479560 


Hs.98613 


Homo sapiens cDNA FU12292 fis, dona MA 




418245 


AA088767 


Hs.83883 


transmembrane, prostate androgen induced 




420714 


BE172704 


Hs.222746 


KIAA1610 protein 




412707 


AW206373 


Hs. 16443 


Homo sapiens cDNA: FLJ21721 fis, clone C 




421896 


N62293 


Hs.45107 


« STs „ 






AI222020 


Hs. 182364 


CocoaCrisp 


fififi 


452465 


AA6 10211 


Hs.34244 






422763 


AA033699 


Hs.83938 


ESTs, Moderately similar to MAS2_HUMAN M 






AV653785 


Hs.300171 


ELL-RELATED RNA POLYMERASE II, ELONGATIO 




450164 


AI239923 


Hs.30098 


ESTs 




431060 


AF039307 


Hs.249171 


homeoboxAII 




408031 


AA081395 




Homo sapiens cDNA FU10366 fis, clone NT 




420285 


AA258124 


Hs.293878 


ESTs, Moderately similar to ZN91_HUMAN Z 




444670 


H58373 


Hs.37494 


hypothetical protein MQC5370 




444489 


AI151010 


Hs.157774 




660 


445685 


AW779829 


Hs.263436 


gb:hn88a05.x1 NCI_CGAP_Kid1 1 Homo sapien 




435677 


AA694142 


Hs.293726 


ESTs, Weakly similar to TSGA RAT TESTIS 




452221 


C21322 




hypothetical protein FLJ22242 




431510 


AA580082 


Hs.112264 






415874 


AF091622 


Hs.78893 


KIAA0244 protein 




418405 


AI858282 


Hs.11898 


ESTs, Highly similar to KIAA1370 protein 




452768 


AW069459 


Hs.61539 


ESTs 




401451 








6.52 


416289 


W26333 




ESTs 


6.52 


431778 


AL080276 


Hs.268562 


regulator of G-protein signalling 17 




409089 


NM_014781 


Hs.50421 


KIAA0203 gene product 




442833 


AA328153 


Hs.88201 


ESTs, Weakly similar to A Chain A, Cryst 


650 


431992 


NM_002742 




protein kinase C, mu 


649 


418B33 


AW974899 


Hs.292776 






429163 


AA884766 




gb:am20a10.s1 Soares_NFL_T_GBC_S1 Homo s 




430403 


AF039390 


Hs.241382 


tumor necrosis factor (ligand) superfami 




443058 


AW451642 


Hs.16732 


ESTs 






AA631143 


Hs.179809 


Homo sapiens prostein mRNA, complete cds 




432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 138022 hypotheti 




423600 


AI633559 


Hs.29076 




(tU 


404253 
433610 


AA806822 


Hs.1 12547 






421552 


AF026692 


Hs.1 05700 


secreted frizzled-related protein 4 




407118 


AA156790 


Hs.262036 


ESTs, Weakly similar to Z223_HUMAN ZINC 




408608 


N79738 


Hs.136102 


KIAA0853 protein 




421452 


AI925946 


Hs.104530 


fetal hypothetical protein 


433285 


AW975944 


Hs.237396 






434926 


BE543269 


Hs.50252 


mitochondrial ribosomal protein L32 




446189 


H85224 


Hs.214013 




RAC\ 


416806 


NM_000288 


Hs.79993 


peroxisomal biogenesis factor 7 




416467 


H57585 


Hs.37467 


ESTs 




453403 


BE466639 


Hs.61779 


Homo sapiens cDNA FU13591 fis, clone PL 




429769 


NM_004917 


Hs.218366 


kallikrein 4 (prostase, enamel matrix, p 




423642 


AW452650 


Hs.157148 


hypothetical protein MGC1 3204 




425843 


BE313280 


Hs.159627 


death associated protein 3 




439221 


M737106 


Hs.32250 


ESTs, Moderately similar to 178885 serin 




428194 


AA765603 


Hs.180877 


H3 histona, family 3B (H3.3B) 


630 








cadherin 3, type 1, P-cadherin (placenta 


6^30 


439366 


AF100143 


Hs.6540 


fibroblast growth factor 13 


6.30 


452789 


AW081626 


Hs.242561 


ESTs 


6.30 


416836 


D54745 


Hs.80247 


cholecystokinin 


6.30 


436962 


AW377314 


Hs.5364 


DKFZP564I052 protein 


6.29 


433383 


AF034837 


Hs.192731 


double-stranded RNA specific adenosine d 


6.29 


418636 


AW749855 




gb:QV4-BT0534-281299-053-c05 BT0534 Homo 


6.26 


450728 


AW162923 


Hs.25363 


presenilin 2 (Alzheimer disease 4) 


6.25 


440293 


AI004193 


Hs.22123 


ESTs 


6.24 


453745 


AA952989 


Hs.63908 


hypothetical protein MGC14726 


6.24 


426595 


AW971980 


Hs.62402 


p21/Cdc42/Rac1-activated kinase 1 (yeast 


6.24 


444412 


AI147652 


Hs.216381 


Homo sapiens clone HH409 unknown mRNA 


624 


413384 


NM 000401 


Hs.75334 


exostoses (multiple) 2 


622 



WO 02/30268 



PCT/US01/32045 



426320 


W47595 


Hs.169300 


transforming growth factor, beta 2 


6.22 


423349 


AF010258 


Hs.127428 


homeo box A9 


6.20 


429165 


AW009866 


Hs.118258 


prostate cancer associated protein 1 


6.18 


424800 


AL035588 


Hs.153203 


MyoD family inhibitor 


6.18 


409S64 


M045B57 


Hs.54943 


fracture callus 1 (rat) homolog 


6.16 




W67821 


Hs.109590 


genethonin 1 


6.16 




AF242769 


Hs.157461 


mesenchymal stem cell protein DSC54 


6.14 


451663 


AI872360 


Hs.209293 


ESTs 


6.14 


413623 


AAB25721 


Hs.246973 


ESTs 


6.12 


452232 


AW020603 


Hs.271698 
Hs.28482 


radial spoke protein 3 
ESTs 


6.12 
6.12 


453390 
435542 


AA862496 
AA687376 


Hs.269533 


ESTs 


6.12 




AB033036 


Hs.97594 


KIAA1210 protein 


6.11 




AA424881 


Hs.256301 


hypothetical protein MGC13170 


6.10 


409734 


BE161664 


Hs.56155 


hypothetical protein 


6.10 




BE223007 


Hs.152460 


Homo sapiens cDNA FU12909 fis, clone NT 


6.10 




AAB05666 


Hs.146217 


Homo sapiens cDNA: FU23077 fis, clone L 


6.10 


41 1479 


AW848047 




gb:IL3-CT0214-291 299-052-A12 CT0214 Homo 


6.10 


438849 


W28948 


Hs.10762 


ESTs 


6.08 


452726 


AF188527 


Hs.61661 


ESTs, Weakly similar to AF174605 1 F-box 


6.08 


445895 


D29954 


Hs.13421 


KIAA0056 protein 


6.08 




AI420611 


Hs.127832 


ESTs 


6.07 




AA410506 


Hs.1 18578 


KIAA0874 protein 


6.06 




AW970017 


Hs.293943 


ESTs, Weakly similar to S65657 alpha-1C- 


6.04 


443646 


AI085198 


Hs.298699 


ESTs 


6.04 


410566 


AA373210 


Hs.43047 


Homo sapiens cDNA FU13585 fis, clone PL 


6.02 




AL117461 


Hs.82719 


Homo sapiens mRNA; cDNA DKFZp586F1822 (f 


6.02 






Hs.125522 


ESTs 


6.02 


434792 


AA649253 


Hs.132458 


ESTs 


6.01 




AW965078 


Hs.30212 


thyroid receptor interacting protein 15 


6.01 




AI831190 


Hs.166676 


ESTs 


6.00 




AI370876 


Hs.123163 


exportin 1 (CRM1, yeast, homolog) 


6.00 




NM 012093 


Hs.18268 


adenylate kinase 5 


6.00 


447713 


AI420733 


Hs.207083 


ESTs 


6.00 




AI758905 


Hs.206063 


ESTs 


6.00 


451640 


AA1 95601 


Hs.26771 


Human DNA sequence from clone 747H23 on 


6.00 




X91662 


Hs.66744 


twist (Drosophila) homolog (acrocephalos 


5.97 


441222 


AI277237 


Hs.44208 


hypothetical protein FU23153 


5.96 




AI758398 


Hs.161318 


ESTs 


5.96 




AA767537 


Hs.197096 


ESTs 


6.95 




NM 006042 


Hs.48384 


heparan sulfate (glucosamine) 3-O-sulfot 


5.94 


453911 


AW503857 


Hs.4007 


Sarcolemmal-associated protein 


5.94 




AA1 14016 


Hs.75746 


aldehyde dehydrogenase 1 family, member 


5.93 




NM.015434 


Hs.48604 


DKFZP434B168 protein 


5.92 




AF086270 


Hs.276554 


heterochromatin-like protein 1 


5.92 




AB011537 


Hs.133466 


slit (Drosophila) homolog 1 


5.91 




AW294092 


Hs.21594 


hypothetical protein MGC15754 


5.91 


420407 


AA814732 


Hs.145010 


lipopolysaccaride-specific response 5-li 


5.91 


450480 


X82125 


Hs.25040 


zinc finger protein 239 


5.90 




AW450669 


Hs.45068 


hypothetical protein DKFZp434H43 


5.88 




NM_003478 


Hs.101299 




5.88 




AF216751 


Hs.26813 


CDA14 


5.83 




AK000028 


Hs.250867 


ribosomal protein S24 


5.86 




AA355288 


Hs.271408 


transitional epithelia response protein 


5.85 








ESTs 


5.84 




AW9 64440 




DC32 


5.84 


452270 


AW975014 


Hs56 


(erroohelatase (protoporphyria) 


5.83 


435867 


AA954229 


Hs.1 14052 


ESTs 


5.82 


417683 


AW566008 


Hs.239154 


ankyrin repeat, family A (RFXANK-like), 


5.82 


432005 


AA524190 


Hs.120777 


ESTs, Weakly similar to ELL2_HUMAN RNA P 


5.81 


406815 


AA833930 


Hs.288036 




5.8Q 


437980 


R50393 


Hs.278436 


WMl'4^p n rotem PyrOPhOSphat9trariSferaS 


5.80 


425856 


AA364908 


Hs.9B927 


hypothetical protein FU13993 


5.79 


400301 


X03635 


Hs.1657 


estrogen receptor 1 


5.78 


446261 


AA313893 


Hs.13399 


hypothetical protein FLJ12615 similar to 


5.78 


410141 


R07775 


Hs.287657 


Homo sapiens cDNA: FU21291 fis, clone C 


5.77 


427258 


AA400091 


Hs.39421 


ESTs 


5.76 


419108 


AA389724 


Hs.191264 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


5.76 


442029 


AW956698 


Hs.14456 


neural precursor cell expressed, develop 


5.76 



WO 02/30268 



PCT/US01/32045 



407783 


AW996872 


Hs.1 72028 


a disintegrin and metalloproteinase doma 


5.75 


434408 


AI031771 


Hs.132586 


ESTs 


5.74 


415077 


L41607 




glucosaminyl (N-acetyl) transferase 2, 1 


5.74 


432435 


BE218886 


Hs.282070 


ESTs 


5.74 


433313 


W20128 


Hs.296039 


ESTs 


5.73 


431740 


N75450 


Hs.183412 


ESTs, Moderately similar to AF1 16721 67 


5.73 


412991 


AW949013 




gb:QV4-FT0005-1 1050O-201-e12 FT0005 Homo 


5.72 


418852 


BE537037 


Hs.273294 


hypothetical protein FU20069 


5.72 


418882 


NMJJ04996 


Hs.89433 


ATP-binding cassette, sub-family C (CFTR 


5.72 


446867 


AB007891 


Hs.16349 


KIAA0431 protein 


5.72 


437866 


AA1 56781 


Hs.83992 


metallothionein 1 E (functional) 


5.72 


410232 


AW372451 


Hs.61184 


CGI-79 protein 


5.70 


414452 


M454038 


Hs.29032 


ESTs 


5.70 




AL031320 


Hs.1 19976 


Human DNA sequence from clone RP1-20N2 o 


5.70 


428730 


AA625947 


Hs.25750 


ESTs 


5.70 




AW500486 


Hs.180610 


splicing factor prollne/glutamine rich { 


5.70 




AF038564 


Hs.98074 


itchy (mouse homolog) E3 ubiquitin prote 


5.70 


450616 


AL133067 


Hs.25214 


hypothetical protein 


5.70 


443774 


AL1 17428 


Hs.9740 


DKFZP434A236 protein 


5.69 


446100 


AW967109 


Hs.13804 


hypothetical protein dJ462023.2 


5.69 


419168 


AI336132 


Hs.33718 


Homo sapiens cDNA FU12641 fis, clone NT 


5.68 


416653 


AA768553 


Hs.77496 


metallothionein 1 E (functional) 


5.67 


452679 


Z42387 


Hs.4299 


transmembrane, prostate androgen induced 


5.66 


450244 


AA007534 


Hs.125062 


ESTs 


5.66 




AI970672 


Hs.46638 


chromosome 11 open reading frame 8 


5.65 




AI935962 


Hs.26289 


ESTs 


5.65 




AW1 62840 


Hs.6641 


kinesin family member 5C 


5.64 


452387 


AI680772 


Hs.4316 


trinucleotide repeat containing 12 


5.64 


413992 


W26276 


Hs.136075 


RNA, U2 small nuclear 


5.63 


444151 


AW972917 


Hs.128749 


alpha-methylacyl-CoA racemase 


5.63 




AW965339 


Hs.1 11471 


ESTs 


5.62 




AI936442 


Hs.59838 


hypothetical protein FU10808 


5.60 




D60925 




ESTs 


5.60 




NM 001394 


Hs.2359 


dual specificity phosphatase 4 


5.60 


434415 


BE1 77494 




gb:RC6-HT0596-270300-011-C05 HT0596 Homo 


5.60 


440738 


AI004650 


Hs.225674 


WD repeat domain 9 


5.60 




AM 42095 


Hs.143273 


ESTs 


5.60 


449603 




Hs.197698 




5.60 






Hs.75912 


KIAA0257 protein 


5.59 




NMJJ1601O 


Hs.1 18821 


CGI-62 protein 


5.56 


435047 


AA454985 


Hs.54973 


cadherin-like protein VR20 


5.55 
5.55 


400268 


AI377431 


Hs.293772 


hypothetical protein MQC10858 


5.54 




AI885608 


Hs.94122 


ESTs 


5.54 






Hs.283059 


Homo sapiens PRO1082 mRNA, complete cds 


5.53 


418339 


AA639902 


Hs.104215 


ESTs, Moderately similar to SPCOUMAN S 


5.52 




BE387162 


Hs.280858 


ESTs, Highly similar to A35661 DNA excis 


5.52 


439569 


AW602166 


Hs.222399 


CEGP1 protein 


5.51 




AA973905 


Hs.16003 


intermediate filament protein syncoilin 


5.50 


448310 


AI480316 




gb:tm26h09.x1 Soares NFL_T_GBC_S1 Homo s 


5.50 


413173 


BE076928 


Hs.70980 


ESTs 


5.48 




AW450963 


Hs.1 19991 


ESTs 


5.48 




AI656959 


Hs.222165 


ESTs 


5.48 




AB012124 


Hs.30696 


transcription factor-like 5 (basic helix 


5.48 




AA885569 




Homo sapiens cDNA FLJ1451 1 fis, clone NT 


5.46 




AI675944 


Hs.188691 


Homo sapiens cDNA FU12033 fis, clone HE 


544 


429126 


AW1 72356 


Hs!99083 


ESTs 


5.44 


431316 


AA502663 


Hs.145037 


ESTs 


5.44 


439192 


AW970536 


Hs.105413 


ESTs 


5.44 


431938 


AA938471 


HS.115242 


specific granule protein (28 kDa); cyste 


5.44 


451552 


AA047233 


Hs.33810 


ESTs 


5.43 


416991 


N36389 


Hs.295091 


KIAA0226 genB product 


5.42 


427638 


AA406411 


Hs.208341 


ESTs, Weakly similar to KIAA0989 protein 


5.42 


427718 


A1798680 


Hs.25933 


ESTs 


5.42 


438710 


AA833907 


Hs.178724 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


5.42 


406076 


AL390179 


Hs.137011 


Homo sapiens mRNA; cDNADKFZp547P134 (fr 


5.40 


431263 


AW129203 


Hs.13743 


ESTs 


5.40 


421264 


AL039123 


Hs.103042 


microtobule-associated protein 1B 


5.38 


421685 


AF1 89723 


Hs.105778 


ATPase, Ca++ transporting, type 2C, memb 


5.37 



WO 02/30268 



PCT/US01/32045 



408460 


AA054726 


Hs.285574 


ESTs 


5.36 




AW970386 


Hs.269423 










Hs.286131 


CGI-101 protein 




428002 


AA418703 




gb:zv98c03.s1 Soares_NhHMPu_S1 Homo sapi 




441217 


Al 922183 


Hs.213246 


ESTs 


5.36 












422806 


BE314767 


Hs.1581 


glutathione S-transferase theta 2 




432281 


AK001239 


Hs.274263 


hypothetical protein FU10377 




451982 


F13036 


Hs.27373 


Homo sapiens mRNA; cDNA DKFZp56401763 (f 




421129 


BE439899 


Hs.89271 


ESTs 




444042 


NM_004915 


Hs.10237 


ATP-binding cassette, sub-family G (WHIT 






AW382942 










AW877787 


Hs. 1361 02 


KIAA0853 protein 






XB5689 


Hs.288617 


hypothetical protein FU22621 




447752 


M73700 


Hs.347 


lactotransferrin 


5.29 




R53790 


Hs.23294 


hypothetical protein FU14393 




431359 


AW993522 


Hs.292934 








AW293849 


Hs.58279 


ESTs, Weakly similar to ALU7_HUMAN ALU S 








Hs.299221 


pyruvate dehydrogenase kinase, isoenzyme 






AW016670 


Hs.233275 








AA758751 


Hs.98216 






428845 


AL157579 


Hs. 153610 


KIM0751 gene product 


5 ~OA 








bromodomain and PHD finger containing, 3 




419986 


AI345455 


Hs.78915 


GA-binding protein transcription factor, 


5.24 




AA312551 


Hs.230157 






420111 


AA255652 




gb:zs21h11.r1 NCI_CGAP_GCB1 Homo sapiens 




428058 


AI821625 


Hs.191602 


ESTs 


5.22 








gb:1)'70e07.x1 Soares_NSF_F8_9W_OT_PA_P_S 




432524 


AI458020 


Hs.293287 






436207 


AA334774 


Hs.12845 


hypothetical protein MGC13159 


5.22 


410870 


U81599 


Hs.66731 


homeo box B13 




451418 


BE387790 


Hs.26369 


hypothetical protein FLJ20287 




409757 


NM 001898 


Hs.123114 


cystatin SN 




441 124 


T97717 


Hs.1 19563 


ESTs 




428593 


AW207440 


Hs.185973 


degenerative spermatocyte (homolog Droso 




436401 


A1087958 


Hs.29088 


ESTs 


5.20 


437113 


AA744693 




gb:ny26c10.s1 NCI_CGAP_GCB1 Homo sapiens 




450947 


AI745400 


Hs.204662 






453279 


AW893940 


Hs.59698 






445467 


AI239832 


Hs.15617 


ESTs, Weakly similar to ALU4_HUMAN ALU S 




448944 


AB014605 


Hs.22599 


atrophin-1 interacting protein 1; activi 




412198 


AA937111 


Hs.69165 






422646 




Hs.151380 


ESTs, Weakly similar to T16584 hypotheti 




438986 


AF085888 


Hs.269307 






453954 


AW1 18338 


Hs.75251 


DEAD/H (Asp-Glu-Ala-Asp/His) box binding 


5 "i« 




AK000288 


Hs.18800 


hypothetical protein FU20281 




434029 


AA621763 


Hs.170434 


Homo sapiens cDNA FU14242 fis, clone OV 




459294 


AW977286 


Hs.169531 


RBP1 -like protein 


5.16 






H&204096 


lipophilin B (uteroglobin family member) 




424692 


AA429834 


Hs.151791 


KIAA0092 gene product 


5.15 




AW020782 


Hs.79881 


Homo sapiens cDNA: FU230O6 fis, clone L 




419872 


AI422951 


Hs.146162 


ESTs 


5.15 


429422 


AK001494 


Hs.202596 


Homo sapiens cDNA FU10632 fis, clone NT 




448902 


Z45998 


Ha22543 


Homo sapiens mRNA; cDNA DKFZp761 11912 (f 








Hs.30567 


ESTs, Weakly similar to B34087 hypotheti 




431318 


AA50270O 


Hs.293147 


ESTs, Moderately similar to A4601 0 X-lin 


5.14 






Hs.271 741 






428372 


AK000684 


Hs.183887 


hypothetical protein FU22104 


5.12 


434401 


AI864131 


Hs.71119 


Putative prostate cancer tumor suppresso 


5.12 


416434 


AW1 63045 


Hs.79334 


nuclear factor, interleukin 3 regulated 


5.11 


410268 


AA316181 


Hs.61635 


six transmembrane epithelial antigen of 


5.10 


417517 


AF001176 


Hs.32238 


POP4 (processing of precursor , S. cerev 


5.10 


453516 


NM_003462 


Hs.33846 


dynein, axonemal, light intermediate pol 


5.10 


427958 


AM18000 


Hs.98280 


potassium intermediate/small conductance 


5.09 


407945 


X69208 


Hs.606 


ATPase, Cu++ transporting, alpha polypep 


5.08 


425154 


NM 001851 


Hs.154850 


collagen, type IX, alpha 1 


5.08 


412863 


AA121673 


Hs.59757 


zinc finger protein 281 


5.06 


420807 


AA280627 


Hs.57846 


ESTs 


5.06 


430568 


AA769221 


Hs.270847 


delta-tubulin 


5.06 



WO 02/30268 



PCT/US01/32045 





AA743991 




gb:ny57g01.s1 NCI_CGAP_Pr18 Homo sapiens 


5.06 


438375 


AW015940 


Hs.232234 


ESTs 


5.06 






Hs.106604 








AW968159 


Hs.289104 


Alu-binding protein with zinc finger dom 


5.05 






Hs.75295 




5.04 




AK000275 


Hs.75871 


5.04 


432729 


AK000292 


Hs.278732 


hypothetical protein FU20285 


5.04 




AI692623 


Hs,121513 


Homo sapiens clone Z"3-1 placenta expres 


5.04 


439662 




Hs.269060 




5.04 




AL389956 


Hs.283858 


Homo sapiens mRNA full length insert cDN 


5.04 




AL049176 


Hs.82223 


chord in-like 


5.02 




AI088192 


Hs.135474 


ESTs, Weakly similar to DDX9 HUMAN ATP-D 






AF093419 


Hs.169378 




5.02 


429782 


NM 005754 


Hs 220689 








AI167145 


Hs,1 65538 


ESTs ' * 


5.02 




AW850417 


Hs.254020 


ESTs, Moderately similar to unnamed prot 


5.02 


438571 


AW020775 


Hs.56022 


ESTs' 


5.02 




AA418204 


Hs.241493 


natural killer-tumor recognition sequenc 


5.02 






Hs.267705 




5.01 


417730 


Z44761 525 




gb:HSC28F061 normalized infant brain cDN 


5.00 








protein kinase C, fota 


5.00 




NM 004855 


Hs.247118 




5.00 




AW978813 


Hs.250867 


metallothionein 1 E (functional) 


5.00 




AB029006 


Hs.26334 


spastic paraplegia 4 (autosomal dominant 


5.00 


420380 


AA640891 


Hs!l 02406 


ESTs 


4.99 






Hs.239625 


ESTs, Weakly similar to altemativsly sp 


4.S9 


442653 


BE269247 


Hs.170226 


gb:601 185486F1 NIH_MGC_8 Homo sapiens cD 


4.98 




AW972565 


Hs 32399 


ESTs, Weakly similar to S51797 vasodilat 






NM 001490 


Hs.159642 


glucosaminyl (N-acetyl) transferase 1 , c 


4.97 


446279 


AA490770 


Hs. 182382 




4.96 




AI752713 


Hs.43845 










Hs.1 68640 






412715 


NM 000947 




primase, polypeptide 2A (58kD) 


4.94 




R61680 


Hs.26904 


ESTs, Moderately similar to Z1 95 HUMAN Z 


4.94 




AW968271 


Hs. 191534 


ESTs| Weakly similar to ALU1 HUMAN ALU S 






N 87820 


Hs 106826 


KIAA1696 protein 


4.93 




AV655234 
BE244053 


Hs.298083 
Hs.79362 


ESTs, Moderately similar to PC4259 ferri 
retinoblastoma-like 2 (p130) 


4.92 
4.92 




AA211467 
AW023323 


Hs. 190488 
Hs.121070 


Homo sapiens, Similar to nuclear localiz 


4.92 
4.92 






Hs.293185 


ESTs, Weakly similar to JC7328 amino acl 


4.91 




AF163474 


Hs.96744 




4.91 










4.91 




AA764852 


Hs.291567 




4.90 




AF003522 


Hs.250500 










Hs 179825 






428804 


AK000713 


Hs.193736 


hypothetical protein FLJ20706 


4.88 






Hs 163440 


Homo sapiens cDNA: FU21000 fis, clone C 




418575 


AA225313 


Hs.222886 


ESTs, Weakly similar to TRHYJ4UMAN TRICH 


4.B6 




AA557191 


Hs.55028 


ESTs, Weakly similar to I54374 gene NF2 


4.86 




AI801777 






4.66 




AI202703 


Hs.152414 




4.86 




NM 005872 


Hs.22960 




4.86 




AJ245587 


Hs.25275 




4.85 






Hs 100489 
















456516 


BE172704 


Hs.222746 


KIAA1610 protein 


4.84 


426501 


AW043782 


Hs.293616 


ESTs 


4.84 


448730 
458339 


AB032983 
AW976853 


Hs.21894 
Hs.1 72843 


KIAA1 157 protein 
ESTs 


4.84 
4.83 


422083 


NM_001141 


Hs.111256 


arachidonate 15-lipoxygenase, second typ 


4.62 


420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FU21245 fis, clone C 


4.82 


424103 


NM_001918 


Hs.139410 


dihydrolipoamide branched chain transacy 


4.82 


449535 


W15267 


Hs.23672 


low density lipoprotein receptor-related 


4.82 


422048 


NM 012445 


Hs.268126 


spondin 2, extracellular matrix protein 


4.82 


416737 


AF154335 


Hs.79691 


LIM domain protein 


4.82 


419972 


AL041465 


Hs.294038 


golgin-67 


4.81 


420235 


AA256756 


Hs.31178 


ESTs 


4.81 


423412 


AF109300 


Hs.147924 


prostate cancer associated protein 5 


4.80 
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M426565 
AW188944 
AI694972 
AA71S026 



457114 A1821625 



410765 
421040 
421518 



450642 
419879 
411440 
450649 
408738 
435020 
411624 



Z17805 

AW749402 

NMJJ01429 

NM_014785 

AW505076 

BE145964 

AA448488 



Hs.269710 
Hs.191602 
Hs.289109 
Hs.301129 
Hs.1 60541 
Hs.61272 
Hs.66180 
Hs.1 35280 
Hs.208819 



Hs.1 96245 
Hs.161712 
Hs.7130 



453800 BE300741 

457528 AW973791 

416795 AI497778 

407302 R74206 
404721 



452741 BE392914 



419706 
412088 
416276 
429281 

45 448207 
408374 
447162 
451900 
421437 

50 418624 
426172 



403764 
410659 
432383 



437812 
433447 
434715 
65 447673 



C04649 
AI689496 
U41060 



AW821252 
AI734080 
AA371307 
AW136488 
AW962597 
A1217477 
AA789081 

A!080175 
AK000144 
AW1 89232 



Hs.297722 
Hs.47313 
Hs.301855 

Hs.55346 
Hs.24944 
Hs.177537 
Hs.21398 
Hs.288416 
Hs.292784 
Hs.168053 
Hs.268755 

Hs.1 68670 
Hs.272203 
Hs.13872 
Hs.37932 
Hs.30323 
Hs.78851 
Hs.10067 
Hs.115831 
Hs.1 1556 
Hs.30503 
Hs.132967 
Hs.77899 
Hs.108932 
Hs.79136 
Hs.29808 
Hs.170577 
Hs.155591 
Hs.157396 
HS57207 
Hs.104336 
Hs.104211 



Hs.25545 
Hs.31305 
Hs.1 94591 
Hs.4029 



Hs.169911 
HS.16B46 
Hs.1 67593 
Hs.1 16410 
Hs.1 82285 



ESTs 
ESTs 

histonedeac8tylase3 

hypothetical protein FU1 1193 

ESTs, Moderately similar to ALU1_HUMAN A 

ESTs 

nucleosome assembly protein Mike 2 

ESTs 

ESTs 

ESTs 

gb:EST376063 MAGE resequences, MAGH Homo 

ESTs 

ESTs 

copine IV 

Homer, neuronal immediate early gene, 2 

gb:QV4-BT0383-281299-061-c06 BT0383 Homo 

E1A binding protein p300 

KIAA0258 gene product 

DiGeorge syndrome critical region gene 8 

KIAA0594 protein 

ribosomal protein L44 

ESTs, Weakly similar to 21 09260A B cell 

hypothetical protein DKFZp761B1514 

ESTs, Moderately similar to A Chain A, H 

hypothetical protein FU13340 



peroxisomal farnesylated protein 

Homo sapiens cDNA FU20843 (is, clone AD 

cytokine-like protein CI 7 

ESTs 

ESTs, Weakly similar to B34087 hypotfieti 
KIAA0217 protein 

ESTs, Weakly similar to A43932 mucin 2 p 
ESTs 

Homo sapiens cDNA FU12566 lis, clone NT 
Homo sapiens cDNA FU1 1 344 lis, clone PL 
Human EST clone 122887 mariner transposo 
tropomyosin 1 (alpha) 
ESTs 

LIV-1 protein, estrogen regulated 

Homo sapiens cDNA: FU21122 fis, clone C 

ESTs 

forkheadboxFI 

ESTs, Weakly similar to A46010 X-linked 

KIAA0982 protein 

hypothetical protein 

ESTs 

ESTs 

ESTs 

KIAA1 547 protein 
ESTs 

glioma-amplified sequence-41 
ESTs 

Homo sapiens cDNA FU20137 fis, clone CO 
cutaneous T-cell lymphoma tumor antigen 
KIAA1 495 protein 
ESTs 

ESTs, Weakly similar to 04HUD1 debrisoqu 
hypothetical protein FU13409; KIAA171 1 
ESTs 
ESTs 



BE391727 Hs.102910 



ESTs 

general transcription factor IIH, polype 
KIAA1 265 protein 
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433644 


AW342028 


Hs.256112 


gb:hb75d03.x1 NCI_CGAP_Ut2 Homo sapiens 


408321 


AW405882 


Hs.44205 


cortistatin 




AA1 92669 


Hs,45032 




440348 


AW015B02 


Hs.47023 


ESTs 


446351 


AW444551 


Hs.258532 


x001 protein 


451212 


AW902672 


Hs.287334 


ESTs 


430294 


A1538226 


Hs.135184 


guanine nucleotide binding protein 4 


435005 


U80743 


Hs.4316 


trinucleotide repeat containing 12 


448072 


AI459306 


Hs.24908 


ESTs 


403721 








451018 


AW965599 


Hs.247324 


mitochondrial ribosomal protein S14 


453070 


AK001465 


Hs.31575 


SEC63, endoplasmic reticulum translocon 


417412 


X16896 


Hs.82112 


interieukin 1 reoeptor, type I 


439735 


AI635386 


Hs.142846 
Hs.134273 


hypothetical protein 
ESTs 


435663 
424036 


AI023707 
AA770688 


Hs.81946 


H2A histone family, member L 


426386 


AA748850 


Hs.174877 


bladder cancer overexpressed protein 


408622 


AA056060 


Hs.202577 


Homo sapiens cDNA FU12166 fis, clone MA 


444269 


AI590346 


Hs.146220 


ESTs 


430187 


AI799909 


Hs.158939 


ESTs 


427761 


AA412205 


Hs.140996 


ESTs 


430261 


AA305127 


Hs.237225 


hypothetical protein HT023 


444169 


AV648170 


Hs.58756 


ESTs 


430598 


AK001764 


Hs.247112 


hypothetical protein FU10902 


412903 


BE007957 


Hs.155795 


ESTs 


417048 


AI088775 


Hs.55498 


geranyigeranyt diphosphate synthase 1 


442710 


AI015631 


Hs.23210 


ESTs 


457413 


AA743462 


Hs.1 65337 


ESTs 


400303 


AA242758 


Hs.79136 


LIV-1 protein, estrogen regulated 


443268 


AI800271 


Hs.129445 


hypothetical protein FU12496 


438209 


AL120659 


Hs.6111 


aryl-hydrocarbon receptor nuclear transi 


431724 


AA514535 


Hs.283704 


ESTs 


412280 


AW205116 


Hs.272814 


hypothetical protein DKFZp434E1723 


440801 


AA906366 


Hs.190535 


ESTs 


452959 


AI933416 


Hs.189674 


ESTs 


453861 


AI026838 


Hs.30120 


ESTs, Weakly similar to NUCLJWMAN NUCLE 


417421 


AL138201 


Hs.82120 


nuclear receptor subfamily 4, group A, m 


447270 


AC002551 


Hs.331 


general transcription factor IIIC, polyp 


433641 


AF080229 




gb:Human endogenous retrovirus K clone 1 


447078 


AW885727 


Hs.301570 


ESTs 


424242 


AA337476 




hypothetical protein MGC13102 


408170 


AW204516 


Hs.31835 


ESTs 


448757 


AI366784 


Hs.48820 


TATA box binding protein (TBP)-associate , 


420021 


AA252848 


Hs.293557 


ESTs 


449694 


AI659790 


HS.253302 


ESTs 


453867 


AI929383 


Hs.108196 


hypothetical protein DKFZp434N185 


458712 


AI347502 


Hs.173066 


hypothetical protein FU20761 


417251 


AW015242 


Hs.99488 


ESTs, Weakly similar to YK54_YEAST HYPOT 


434423 


NMJX36769 


HS.3B44 


LIM domain only 4 


423427 


AL137612 


Hs.285848 


KIAA1454 protein 


415715 


F30364 




ESTs 


404561 








422969 


M782536 


Hs.122647 


N-myristoyltransferase2 


423685 


BE350494 


Hs.49753 


uveal autoantigen with coiled coil domal 


443977 


AL120986 


Hs.150627 


ESTs, Weakly similar to I38022 hypotheti 


425071 


NM_013989 


Hs.154424 


deiodinase, iodothyronine, type II 


431583 


AL042613 


Hs.262476 




411379 


AI816344 


Hs.12554 


ESTs, Weakly similar to NPL4 HUMAN NUCLE 


421476 


AW953805 


Hs.21887 


ESTs 


425178 


H16097 


Hs.161027 


ESTs 


439262 


AA832333 


Hs.124399 


ESTs 


442818 


AK001741 


Hs.8739 


hypothetical protein FU10879 


421977 


W94197 


Hs.110165 


ribosomal protein L26 homolog 


437114 


AA836641 


Hs.163085 


ESTs 


420195 


N44348 


Hs.300794 


Homo sapiens cDNA FU11177 fe, clone PL 


418330 


BE409405 


Hs.94722 


ESTs 


419750 


AL079741 


Hs.183114 


Homo sapiens cDNA FU14236 fis, clone NT 



gb:HC3-HT0585-16030(M)22-b09 HT0585 Homo 

153 
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AA1 79233 














Hs.1 32243 






















AA732524 


Hs.151464 


ESTs, Weakly similar to ALUC HUMAN III! 


4ri 


5 




AI273419 


Hs 135146 










AA282330 


Hs 145668 








448330 


ALO 36449 


Hs 207163 










NM 005923 


Hs.151988 


mitogen-activated protein kinase kinase 


4J>1 


10 


432621 


AI298501 


Hs.12807 


ESTs, Weakly similar to T46428 hypotheti 






AI248720 


Hs.1 14390 










AA662913 


Hs.190173 


ESTs Weakly similar to A46010 X-linked 






424085 


NM 002914 


Hs.1 39226 




420 




440749 


W22335 




hypothetical protein^MGC31 99^ 




15 




W93048 


Hs.227203 


hjflwthetical protein MGC2747 




443414 




Hs.25209 




4^20 






AA2 56769 


Hs.94949 








444170 


AW613879 


Hs.1 02408 










AA766998 








20 












447476 


BE293466 


Hs.20880 


ESTs, Weakly similar to I38022 hypotheti 


419 




448543 


AW897741 


Hs.21380 


Homo sapiens mRNA; cDNA DKFZp586P1 124 (f 


4^18 




410294 


AB014515 


Hs.288891 


KIAA0615 gene product 


4.18 






AA602004 








25 








RBPl'.rke 6 1 ' y Sm ' - 




447194 


AW976438 


Hs 17426 




418 






AW959731 


Hs.32538 








439328 


W0741 1 


Hs 118212 


ESTs Moderately similar to ALU3 HUMAN A 










Hs.299842 






30 




AI2830B5 


Hs.290931 


ESTs, Weakly similar to YFJ7 YEAST HYPOT 






AK001 160 












Al 089292 


Hs.287621 


hypothetical protein FLJ14069 








Z40239 92 










419103 




Hs 96423 


hypothetical'protein FU23033 


4 14 


35 


4054UJ 






cSTs, Moderately similar to I3B022 nypot 






















Hs 94834 


















40 


438578 


AA811244 


Hs.164163 


ESTs Sap ' ens 


414 




AI697193 












AW366286 


Hs.145696 


splicing factor (CC1 3) 






448148 


NM 016578 




HBV pX associated protein-8 


413 










s^ne/ftrMnhe'ta'na^^ 6 ^ 




45 


417531 


NM_003157 


Hs1087 




412 








hypothetical protein MGC1 0895s 








AA548518 












AA836126 










434739 


AA804487 


Hs.144130 


ESTs 


4 12 


50 




AW2 05969 


Hs.131803 






425810 


AI923627 


Hs.31903 


ESTs 


410 






AW973775 


Hs.1 30760 














hypothetical protein FLJ13117 






4W1? 


AB016247 


Hs.288031 




409 


55 




AL1 62039 


Hs.31422 


Hoino sapiens mRNA* cDNA DKFZp434M229 (fr 






AA099802 












AA775483 


Hs.288936 








443204 


AW205878 


Hs.29643 


Homo sapiens cDNA FU13103 lis, clone NT 


4.08 




445459 


AI47B629 


Hs.158465 


likely ortholog of mouse p jtative IKK re 


4.08 


60 


43B938 


H46212 


Hs.137221 


ESTs 


4.07 


454119 


BE549773 


Hs.40510 


uncoupling protein 4 


4.06 




411000 


N40449 


Hs.201619 


ESTs, Weakly similar to S38383 SEB4B pro 


4.06 




418926 


AA232658 


Hs.87070 


UDP-glucose:glycoprotein glucosyltransfe 


4.06 




424432 


AB037821 


Hs.1 46858 


protocadherin 10 


4.06 


65 


449673 


AA002064 


Hs.18920 


ESTs 


4.06 


429299 


AI620463 


Hs.99197 


hypothetical protein MGC13102 


4.03 




422174 


AL049325 


Hs.1 12493 


Homo sapiens mRNA; cDNA DKFZp564D036 (fr 


4.05 




455497 


AA1 12573 


Hs.285691 


Homo sapiens prostein mRNA, complete cds 


4.05 




415138 


C18356 


Hs.78045 


tissue factor pathway Inhibitor 2 


4.04 




402791 








4.04 



154 
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Hs. 172329 












Homo sapiens, done IMAGE:301G666, mRNA, 










ESTs, Weakly similar to ALU8_HUMAN ALU S 






AF155655 


Hs18835 




And 




AI739168 


Hs 131798 


Homo sapiens cDNAFLil 3458 fis clone PL 






AI572438 


Hs.32976 


guanine nucleotide binding protein 4 




453857 








402 


428249 


AA130914 


Hs.183291 


zinc finger protein 268 


4.02 


432015 


AL157504 


Hs.159115 


Homo sapiens mRNA; cDNA DKFZp58600724 (f 


4.02 




BE622641 


H s.38489 


ESTs, Weakly similar to 138022 hypotheti 






M86178 












2334P0 




402 




AA284040 




ESTs, Highly Similar to CA5B__HUMAN CARBO 




1«>MQ 


AA203285 


HS.294141 


ESTs, Weakly similar to alternatively sp 


In? 




AW068658 


Hs.20943 






4C37204 




Hs. 140237 


ESTs, Weakly similar to ALU1JHUMAN ALU S 






AVWI27<K 






401 




AA706319 










AK00Q213 




nypotiietical protein 






AA247152 




ESTs, Weakly similar to KIAA1074 protein 






AA216419 


Hs.269295 


gb:nc16e03.s1 NCI_CGAP_Pr1 Homo sapiens 






AW291 128 


Hs.278422 


DKFZP586G1 122 protein 








Hs.1 00293 


O-linked N-acetylglucosamine (GlcNAc) tr 






AW015644 


Hs.301430 


TEA domain family member 1 (SV40 transcr 


dm 




AL049266 


Hs. 18724 


Homo sapiens mRNA; cDNA DKFZp564FQ93 (fr 






AL133761 




gb:DKFZp761C1413_r1 761 (synonym: hamy2) 






AA631910 


Hs.1 62849 


cf ? ■ „„„, L 










ESTs, Weakly similar to I38022 hypotneti 


OOQ 




AW836261 


Hs.1 77486 
















Iiqroq 


AW015206 


Hs. 178784 








AB020695 


Hs.91662 


KIAA0888 protein 


IQfi 




AF1 50262 


Hs. 162898 


















AA504428 




Homo sapiens, clone IMAGE:3954132, mRNA, 






NM_000380 


Hs.1 92803 


xeroderma pigmentosum, complementation g 






AA314998 


Hs.241503 


hypothetical protein 






NM_0O3877 


Hs, 110776 


oiai inuuceooiAi innioitor-^ 


3Q5 














AW894Q17 




gb'RC4-NN0027-150400-012-g04 NN0027 Homo 








Hs.201194 


EST f 






AA909249 


Hs.1 12282 


solute earner family 30 (zinc transport 
















AA761 073 




TRAF family member-associated NFKB actw 






AW23S021 


Hs.1 08788 


Homo sapiens, Similar to RIKEN CDNA5730 




449869 




Hs.60059 


Homo sapiens cDNA FU11478 fis, clone HE 






AW182924 


Hs, 128790 








AI400752 












Hs.1 87931 








AW235094 


Hs.1 93424 


zinc finger protein 




424882 


AI379461 


Hs.1 53636 


far upstream element (FUSE) binding prot 








Hs.1 68950 


Homo sapiens mRNA; cDNA DKFZp566A1046 (F 






AA397679 


Hs.298460 






440404 


AI015881 


Hs.1 25616 


mitochondrial nbosomal protein S5 






AW501435 


Hs.1 71409 


v-akt murine thymoma viral oncogene homo 




453058 


AW612293 


Hs.288684 


Homo sapiens cDNA FU11750 fis, done HE 


3.92 


423583 


AL12205S 


Hs.1 29836 


KIAA1 028 protein 


3.92 


408001 


AA046458 


Hs.95296 


ESTs 


3.92 


419197 


N48921 


Hs.27441 


KIAA1615 protein 


3.91 


428695 


AI355647 


HS.189999 


purinergic receptor (family A group 5) 


3.91 


401747 








3.91 


410011 


AB020641 


HS.57856 


PFTAIRE protein kinase 1 


3.91 


432205 


AI806583 


Hs.125291 


ESTs 


3.91 


447857 


AA081218 


HS.58603 


Homo sapiens cDNA FU14206 fis, done NT 


3.91 


446494 


AA463276 


Hs.288906 


WW Domain-Containing Gene 


3.91 


409928 


AL1 37163 


Hs.57549 


hypothetical protein dJ473B4 


3.90 


411598 


BE336654 


Hs.70937 


H3 histone family, member A 


3.90 


424790 


All 19344 


Hs.13326 


ESTs, Weakly similar to 2004399A chromos 


3.90 
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E74-like factor 5 (ets domain transcript 


3.90 


431325 


AW026751 


Hs5794 3 


ESTs, Weakly similar to 21 09260A B cell 


3.89 




NM 003729 




RNA 3'-terminal phosphate cyclase 


3.89 


401045 












AW864793 


Hs 34161 


thrombospondin 1 


3^69 




BE378541 


Hs.279815 


cysteine sulfonic acid decarboxylase-ret 








Hs.283631 


hypothetical protein DKFZp547G183 




417067 




Hs.81086 


solute carrier family 22 (extraneuronal 


3.88 




AF1 02546 


Hs.63931 






422660 


AW297582 


Hs.237062 


hypothetical protein FU22548 similar to 


3.88 




AB035301 


Hs.272211 




3.88 




AW023798 
AA613792 


Hs.286025 


gb:no97h03.s1 NCI_CGAP_Pr2 Homo sapiens 


3.88 
3.88 










3.B8 


431088 


AA491824 


Hs.196881 




3.88 




AL120173 


Hs.301663 


ESTs 


3.B7 


422089 


AA523172 


Hs.1 03135 


ESTs, Weakly similar to SFR4_HUMAN SPLIC 


3.87 




AL049O13 


Hs.28783 


KIAA1223 protein 


3.87 




AA805166 


Hs.165165 


HIV-1 rev binding protein 2 


3.86 




AI929602 




phosphatidylinositol glycan, class H 


3.86 










3.86 






Hs.288390 


hypothetical protein FLJ22795 


3.85 






Hs 191343 






433332 


AI367347 


Hs.1 27809 


Homo sapiens clone TCCCTA00151 mRNA sequ 


3.65 




AA456454 


Hs.1 18637 


cell division cycle 2-like 1 (PITSLRE pr 


3.85 




AL1 33900 






3.84 




AA1 301 58 


Hs.1 9977 


ESTs, Moderately simitar to ALU8 HUMAN A 


3.84 




AL037421 


Hs.208746 


ESTs, Moderately similar to pot. ORF I [ 


3.84 




AW247430 
AW503398 


Hs.84152 
Hs.210047 


ESTs, Moderately similar to I38022 hypot 




434804 


AA649530 




gb;ns44f05.s1 NCt_CGAP_AM Homo sapiens 


3.83 


















Homo sapiens, clone IMAGE:4285740, mRNA 


3.82 




AA151 057 


Hs 153498 


chromosome 18 open reading frame 1 






BE246138 


Hs.30853 




3.82 




T78300 


Hs 171409 










Hs.179729 




3.82 


427756 


AI376540 




ESTs ' 


3.82 


444701 


AI916512 


Hs.198394 




3.82 






Hs 123072 






429259 


AA420450 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


3.82 




AA033813 


Hs.79018 


chromatin assembly factor 1 , subunit A ( 


3.82 








gb:yd78d06.s1 Soares fetal liver spleen 
lunton' neml)er ° nC0Sene fam " y '"' <e 1 


3.81 


410P97 


AA148710 


Hs\ 59441 




3.81 




AW1 17322 










W87707 


Hs.82065 




3.80 


419423 


D25488 


Hs.90315 


KIAA0007 protein 


3.80 






Hs 187548 










Hs.258561 






445060 


AA830811 


Hs.88803 


ESTs 


3.80 


449419 


R34910 


Hs.1 19172 


ESTs 


3.80 




AA040403 


Hs.60371 




3.80 




AL040683 


Hs.1 67031 


DKFZP566D133 protein 


3.79 




AL044056 


Hs.1 58047 




3.79 












444078 


BE246919 


Hs.10290 


U5 snRNP-specific 40 kDa protein (hPrp8- 


3.78 


417318 


AW953937 


Hs.12891 


ESTs 


3.78 


414664 


AA587775 


Hs.66295 


muiti-PDZ-domain-containing protein 


3.78 


410275 


U85658 


Hs.61796 


transcription factor AP-2 gamma (actfvat 


3.77 


410503 


AW975746 


Hs.188662 


KIAA1702 protein 


3.77 


434170 


AA626509 


Hs.122329 


ESTs 


3.77 


421838 
425268 
431696 


AW881089 
AI807883 
AA259068 


HS.1 08806 
Hs.156932 
Hs.267819 


Homo sapiens mRNA; cDNA DKFZp566M0947 (f 
Homo sapiens cDNA FU20653 fis, clone KA 
protein phosphatase 1, regulatory (inhib 


3.77 
3.76 
3.76 


411990 


AW963624 


Hs.31707 


ESTs, Weakly similar to YEW4JTEAST HYPOT 


3.76 


430291 


AV660345 


Hs.238126 


CGI-49 protein 


3.76 


448779 


BE042877 


Hs.177135 


ESTs 


3.76 


452682 


AA456193 


Hs.155606 


progesterone membrane binding protein 


3.75 
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Al 83 1594 




ESTs, Weakly similar to ALU7_HUMAN ALU S 


3.75 




AA9 08731 




CLLL8 protein 


3.75 


440258 


AI741633 


Hs.125350 


3.74 




AL121087 


Hs.298406 


KIAA0685 gene product 








Hs88550 5 


ESTs, Weakly similar to JC5238 galactosy 




420653 


AI224532 






3.74 




AI879330 


HS265960 


rrypotfietical protein FLJ10563 


3.74 






Hs.156971 


hypothetical protein DKFZp434G1415 


3.74 










3.74 


419440 


AB020689 


Hs.90419 


KIAA0882 protein 






BE546208 


Hs.26090 


hypothetical protein FU20272 


3.73 




NM 003816 










AI732722 


Hs.1 87694 


ERQL protein; ERGIC-53-like protein 




438394 


BE379623 


Hs.27693 








AV6571 17 


Hs.184164 


ESTs/Moderately simila/to S65657 alpha 




449433 


AI672096 


Hs.9012 


ESTs] Weakly similar to S26650 DNA-bindi 
ESTs' 


3.72 




BE5 03227 


Hs.134759 


3.72 




BE614599 


Hs.106823 


hypothetical protein MGC14797 










Friedreich ataxia region gene X123 


3.72 






Hs.165372 


ESTs, Moderately similar to ALU1_HUMAN A 


3.71 




AI224483 








449897 


AW819642 


Hs.24135 


transmembrane protein vezatin; hypotfieti 


3.71 




AI628272 




ESTs, Weakly similar to ALU1 HUMAN ALU S 








Hs.194606 










Hs.199429 


Homo sapiens'mRNA; cDNA DKFZp434M2216 (f 






AA768098 


Hs.1 89079 








AF030933 










BE502601 


Hs.134289 


ESTs, Weakly similar to KIAA1063 protein 






BE242873 










AI659306 


Hs.73826 








AI378329 


Hs.126629 








AL1 20697 


Hs.110640 






419994 


AA282881 


Hs!l90057 


ESTs 


3.69 


412628 


AI972402 


Hs.173902 


hypothetical protein MGC2648 


3:69 




AA532718 


Hs.178604 


ESTs 


3.69 


439444 


Al 2776 52 




ESTs, Weakly similar to I38022 hypotheti 






AA704703 


Hs 77031 


E^e^vm8 Pt ' 0n ' aCt0r 




447397 


BE247676 






368 














AU076696 




CDC5 (cell division cycle 5, S. pombe, h 




442242 


AV647908 


Hs90424 


Homo sapiens cDNA: FU23285 (is, clone H 


368 




BE538356 










AI318624 


Hs.107444 










Hs.174007 








AI720078 


Hs.291997 


ESTs, Weakly similar to A47582 B-cell gr 








Hs.44898 








AW880562 


Hs.114574 






427356 


AW023482 


Hs.97849 


ESTs 


3.66 






Hs.31092 




3.66 






Hs.89584 








AI064824 


Hs. 193385 












vasoa e in es na pep e re p 




447500 


AI381900 


Hs.159212 




365 




A1696671 


Hs.294110 








AI382555 


Hs.127950 






419346 


AI830417 




polybromo 1 


&64 


441540 


C01367 


Hs.127128 


ESTs 


3.64 


446501 


AI302616 


HS.150B19 


ESTs 


3.64 


459527 


AW977556 


Hs.291735 


ESTs, Weakly similarto I78885 serina/th 


3.63 


446320 


AF126245 


Hs.14791 


acyl-Coenzyme A dehydrogenase family, me 


3.63 


435706 


W31254 


Hs.7045 


GL004 protein 


3.63 


400110 








3.62 


410313 


R10305 


Hs.185683 


ESTs 


3.62 


414713 


BE465243 


Hs.12664 


ESTs 


3.62 


436279 


AW900372 


Hs.180793 


ESTs, Weakly similarto S65657 alpha-1C- 


3.62 


439818 


AL360137 


Hs.19934 


Homo sapiens iriRNA full length insert cDN 


3.62 


451797 


AW663858 


Hs.56120 


small inducible cytokine subfamily E, me 


3.62 


451294 


AI457338 


HS29894 


ESTs 


3.62 
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Hs 283940 


Homo sapions PRO1550 mRNA partial cds 






AW968504 


Hs 1 23073 


CDC2-related protein kinase 7 


































AW3737B4 






























AA504249 


Hs 187585 








BE540255 










Z42047 22 








434228 




Hs.283978 


Homo sapiens PR02751 mRNA complete cds 


360 




AA731491 




hypothetical protein MGC14879 






AW005505 










H46008 


He Q1R1 Q 






404210 








359 




BE270828 


Hs.131740 


Homo sapiens cDMA: FU22562 fis, clone H 








Hs, 122421 


Human DNA sequence from clone RP1-187J1 1 




4M147 


AA987927 


Hs 131740 


Homo sapiens cDNA* FU22562 fis clone H 






AA024898 


Hs.296002 












matrix metalbproteinase 26 










hypothetical protein FLJ10980 








Hs. 124344 












histonedeacetylase7B 






AA340605 


Hs.1 05887 


ESTs, Weakly similar to Homo log of rat 2 


\ w 




AW975076 


Hs 1 72589 








AL034344 


Hs.298020 


forkhead box C1 








Hs.85092 


thyroid hormone receptor interador 1 1 




428949 


AA442153 


Hs.1 04744 


hypothetical protein DKFZp434J0617 






Al 685841 


Hs.1 61354 






433339 


AF019226 


Hs.8036 


glioblastoma overexpressed 


3^54 


424369 


R87622 


Hs.26714 


KIAA1831 protein 


3.54 




AF048730 












HS.31416 




4«W1 


AI648602 


Hs 131189 












RALBP1 associated tps domain containing 














AW499536 




gb'UI-HF-BR0p-aji-c-12-0-Ul r1 NIH MGC 5 






AA363025 


Hs.1 55572 


Human clone 23801 mRNA sequence 






AW236550 


Hs.131914 






438^88 


AA806349 






^2 




AV646177 










AL360159 












nS.257359 


















Al 821926 


Hs.269507 


gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 






AL135173 










AB037817 


HS.23Q188 


KIAA1396 protein 






H99999 










AA224077 




Sm protein F 














N23018 


Hs.1 71391 


C-terminal binding protein 2 






BE379646 




Homo sapiens mRNA full length insert cDN 




447311 


R37010 


HS33417 


Homo sapiens cDNA: FU22806 fis, clone K 


350 


447805 


AW627932 


Hs!l9614 


gemin4 


3.50 


454265 


H03556 


Hs.300949 


ESTs, Weakly similar to thyroid hormone 


3.50 


418838 


AW385224 


Hs.35198 


ectonucleatidepyropiiosphatase/phosphodi 


3.50 


448804 


AW512213 


Hs.42500 


ADP-ribosylation factor-like 5 


3.50 


409S17 


BE003760 


Hs.55209 


Homo sapiens mRNA; cDNA DKFZp434K051 4 (f 


3.49 


434075 


AW003416 


Hs.160604 


ESTs 


3.49 


444190 


AI878918 


Hs.10526 


cysteine and glycine-rich protein 2 


3.49 


435017 


AA336522 


Hs.12854 


angiotensin II, type 1 receptor-assooiat 


3.48 


423445 


NM 014324 


Hs.128749 


aipha-methylacyl-CoA racemase 


3.48 


420271 


AI954365 


Hs.42892 


ESTs 


3.48 


443S84 


AI631307 


Hs.165674 


ESTs 


3.48 


444168 


AW379879 




gb:RC1-HT0256-081199-011-f01 HT0256Homo 


3.48 


446074 


AA079799 


Hs.29263 


hypothetical protein FU11896 


3.48 
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AL1 37407 




Homo sapiens mRNA; cDNA DKFZp434M232 (fr 










Fq? W kl 






AW975050 










AW975339 


Hs.239828 


ESTs, Weakly similar to GAG2_HUMAN RETRO 






AA509200 




gb:af12e02.s1 Soares_testis_NHT Homo sap 










carboxypeptklase E 






AF035269 




phosphatidytserine-specifio phospholipas 






AW796524 




Homo sapiens microsomal signal peptidase 






AA354940 


Hs.145958 






442007 


AA301116 


Hs.142838 


nucleolar phosphoprotem Nopp34 






AW007066 


Hs.18949 


ESTs, Weakly similar to CA2B_HUMAN COLLA 


345 


426931 


NM 003416 




zinc finger protein 7 (KOX 4, clone HF.1 


345 


403739 


W01556 


Hs.238797 


ESTs, Moderately similar to 138022 hypot 


3.45 


436024 


AI800041 


Hs.190555 


ESTs 


3.45 


408418 


AW963897 


Hs.44743 


KIAA1435 protein 


3.45 


409151 


AA306105 


Hs.50785 


SEC22, vesicle trafficking protein (S. c 


3.44 


418626 


AW299508 


Hs.135230 


ESTs 


3.44 


420560 


AW207748 


HS.59115 


ESTs 


3.44 


420686 


AI950339 


Hs.40782 


ESTs 


3.44 


428870 


AA436831 


Hs.36049 


ESTs 


3.44 


436754 


AI061283 


Hs.133437 


ESTs 


3.44 


437960 


AI669585 


Hs.222194 


ESTs 


3.44 


452300 


AW628045 


Hs.28896 


Homo sapiens mRNA full length insert cDN 


3.44 


421887 


AW161450 


Hs.109201 


CGI-86 piolein 


3.44 
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TABLE 5A shows the accession numbers for those primekeys lacking a unigenelD in Tables 
5, 6, and 7. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
5 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



409770 
411440 
411479 

411624 
412991 
414269 
415123 
415715 
416288 
416289 
417730 
418636 
419346 
419536 
420111 
422219 
424179 



55 437113 
444168 
448212 



115301J 
1154048J 
124577J 
1247077J 



R86913R83901 H25352 R01370 H43764 AW044451 W21298 



AW963990AAO78196 AW749482 AA077468 BE151571 AA376917 
AW499536 AW499553 AW502138 AW499537 AW502136 AW501743 
AW749402 AW749403 Z45743 R80376 M093358 

AW848047 AW848202 AW848631 AW848142 AW848702 AW848121 AW848632 AW848140 AW848571 

AW848009 AW648067 AWB48069 AW848905 AW848214 

BE145964BE146286AW854564 

AW949013AA126111 

AA298489AA1 37165 

D60925 D60828D80787 

F30364 F36559T15435 

H51299 H44619 H46391 R86024 H51892 T72744 
W26333R05358 H44682 
Z44761 R25801 R11926 R35604 
AW749855 AA225995 AW750208 AW750206 
AI830417AA236612 



AA255652 AA28091 1 AW967920 AA262684 

AW978073 AW978072 AA807550 AA306567 

F30712 F35665 AW263888 AI904014 AI904018 AA336927 AA336502 

AA337476 AW966227 AA450376 AW960222 M381051 

AA418703 AA418711 BE071915 BE071920BE071912 

AA884766 AW974271 AA592975 AA447312 

AA527941 AI810608 AI620190 AA635266 



AA650114 AW974148 AA572946 
T85301 AW517087 M601054BE073959 

AF080229 AF080231 AF080230 AF080232 AF0802.33 AF080234 BE550633 A1636743 AW614951 BE467547 
AI680833 AI633818 N29986 U87592 U87593 U87590 US7591 S46404 U87587 AA463992 AW206802 AI970376 
AI583718 AI672574 N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 AI214988 
AA204735 AA207155 AA206262 AA204833 AW003247 AW496808 AI080480 AI631 703 AI651 023 AI86741 8 
AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 AI652535 BE465762 AA206331 AW45186B 
AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 M703396 H92278 AW1 39734 
H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE466611 AI206344 AA574397 AA348354 
AI493192 

M743991 AA604852AW272737 
AAS13792 AW182329 T05304 AW858385 
BE177494 AW276909 M632849 
T52172AF147324 T52248 
AA649530AA659316 H54973 
AA744693AW750059 



AI475858AW969013 
AI480316 AW847535 
M86178AI813822 D56993 
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BE077084 AW139963 AW863127AW806209 AW806204 AWB06205 AW806206 AW80S211 AW806212 



AW83861 6 AW838S60 BE144343 AI914520 AW888910 BE1 84854 BE184784 
AL133761 AL133767 

BE176479 BE17B678 BE176357 BE176550 AW886079 BE176676 BE176615 BE1766S5 BE176489 BE176610 
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TABLE 5B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in Tables 5, 6, and 7. For each predicted exon, we have listed the 
genomic sequence source used for prediction. Nucleotide locations of each predicted exon 
are also listed. 



Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the 

publication entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 











401045 


8117619 


Plus 


90044-90184,911 1 1-91345 


401424 


8176894 


Plus 


24223-24428 


401451 


6634068 


Minus 


119926-121272 


401714 


6715702 


Plus 


96484-96681 


401747 


9789572 


Minus 


118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097- 








131258,131866-131932,132451-132575,133580-134011 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401819 


7467933 


Minus 


28217-28486 


402408 


9796239 


Minus 


■ 110326-110491 


402444 


9796614 


Plus 


28391-28517 


402791 


6137008 


Minus 


51036-51207 


403047 


3540153 


Minus 


59793-59968 


403137 


9211494 


Minus 


92349-92572,92958-93084,93579-93712,93949-94072,94591-94748,95214-95337 


403721 


7528046 


Minus 


156647-157366 


403764 


7717105 


Minus 


118692-118853 


403797 


8099896 


Minus 


123065-125008 


404165 


9926489 


Minus 


69025-69128 


404210 


5006246 


Plus 
Minus 


169926-170121 
55675-56055 


404253 
404561 


9367202 
9795980 


Minus 


69039-70100 


404571 


7249169 


Minus 


112450-112648 


404721 


9856648 


Minus 


173763-174294 


404915 


7341766 


Minus 


100915-101087 


404939 


6862697 


Plus 


175318-175476 


405403 


6850244 


Minus 


37491-37670,40951-41031 


405685 


4503129 


Minus 


37956-38097 


405718 


9795467 


Plus 


113080-113266 


405793 


1405887 


Minus 


89197-89453 


405876 


6758747 


Plus 


39694-40031 


405917 


7712162 


Minus 


106829-107213 


406414 


9256407 


Plus 


49593-49850 


406554 


7711566 


Plus 


106956-107121 
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TABLE 6:286 GENES ENCODING EXTRACELLULAR OR CELL SURFACE 
PROTEINS UP-REGULATED IN PROSTATE CANCER COMPARED TO 
NORMAL ADULT TISSUES 

Table 6 shows 286 genes up-regulated in prostate cancer compared to normal adult tissues 
5 that are likely to be extracellular or cell-surface proteins. These were selected as for Table 5 
and the predicted protein contained a structural domain that is indicative of extracellular 
localization (e.g. egf, 7tm domains). 



Pkey: 




Unique Eos probeset Identifier number 




ExAccn: 




Exemplar Acces: 


sion number, Genbank accession number 




UnigenelD: 




Unigene number 




Unigene Title: 




Unigene gene title 








Ratio of tumor to 


normal tissue 




Pkey 


ExAccn 


UnigenelD 


Uningene Title 


R1 


409361 


NMJM5982 


Hs.54416 


sine oculis homeobox (Drosophila) homolo 


4828 


409731 


AA1 25985 


Hs.56145 


thymosin, beta, identified in neuroblast 


45.24 


400298 


AA032279 


Hs.61635 


six transmembrane epithelial antigen of 


43.48 


420154 


AI093155 


Hs.95420 


JM27 protein 


41.12 


426747 


AA535210 


Hs.171995 


kaliikrein 3, (prostate specific antigen 


31.80 


400299 


X07730 


Hs.171995 


kallikrein 3, (prostate specific antigen 


24.91 


425075 


AA506324 


Hs.1852 


acid phosphatase, prostate 


2453 


424846 


AU077324 


Hs.1832 


neuropeptide Y 


23.57 


405635 








20.90 


420757 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


19.72 


418994 


AA296520 


Hs.89546 


selectin E (endothelial adhesion molecul 


19.56 


452792 


AB037765 


Hs.30652 


KIAA1344 protein 


17.39 


445472 


AB006631 


Hs.12784 


Homo sapiens mRNA for KIAA0293 gene, par 


17.00 


414565 


AA502972 


Hs.183390 


hypothetical protein FU1359Q 


16.82 


431716 


D89053 


Hs.268012 


fatty-acid-Coenzyme A ligase, long-chain 


16.60 


408430 


S79876 


Hs.44926 


dipeptidylpeptidase IV (CD26, adenosine 


1628 


408000 


L11690 


Hs.620 


bullous pemphigoid antigen 1 (230/240 kD) 


15.54 


430226 


BE245562 


Hs.2551 


adrenergic, beta-2-, receptor, surface 
hypothetical protein FU1 1264 
calmegin 


15.40 


444484 


AK002126 


Hs.11260 


14.76 




AA279490 


Hs.86368 




448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-like and 


14.55 


416182 


NMJM4354 


Hs.79069 


cyclin G2 


12.94 


420544 


AA677577 


Hs.98732 


Homo sapiens Chromosome 16 BAC clone CIT 


12.79 


445413 


AA151342 


HS.12677 


CGI-147 protein 


12.64 


453930 


AA419466 


Hs.36727 


hypothetical protein FLJ10903 


1222 


440286 


U29589 


Hs.7138 


cholinergic receptor, muscarinic 3 


12.04 


452784 


BE463857 


Hs.151258 


hypothetical protein FU21 062 


11.86 


450203 


AF097994 


Hs.301528 


L-kynurenine/alpha-aminoadipateaminotra 


11.68 


448045 


AJ297436 


Hs.20166 


prostate stem cell antigen 


1151 


449650 


AF055575 


Hs.23838 


calcium channel, voltage-dependent, L ty 


11.18 


420381 


D50640 


Hs.337616 


phosphodiesterase 3B, cGMP-inhibited 


11.10 


425665 


AK001050 


Hs.159066 


hypothetical protein FU10188 


11.08 


425710 


AF030880 


Hs.159275 


solute carrier family, member 4 


11.08 


428728 


NMJH6625 


Hs.191381 


hypothetical protein 


11.04 


407021 


U52077 




gb:Human mariner! transposase gene, comp 


11.02 


410733 


D84284 


Hs.66052 


CD38 antigen (p45) 


11.02 


452340 


NM 002202 


Hs.505 


ISL1 transcription factor, LIM/homeodoma 


10.85 


428819 


AL1 35623 


Hs.1 93914 


KIAA0575 gene product 
KIAA0990 protein 


10.48 


421991 


NM 014918 


Hs.1 10488 


10.04 


431217 


NMJJ13427 


Hs.250830 


Rho GTPase activating protein 6 


9.75 


421470 


R27496 


Hs.1 378 


annexinA3 


9.64 


409262 


AK000631 


Hs.52256 


hypothetical protein FU20624 


9.45 


435980 


AF274571 


Hs.129142 


deoxyribonuclease II beta 


9.24 


421246 


AW582962 


Hs.1 02897 


CGI-47 protein 


920 


410001 


AB041036 


Hs.57771 


kallikrein 11 


9.03 


441791 


AW372449 


Hs.175982 


hypothetical protein FU21 159 


9.02 



163 



WO 02/30268 



PCT/US01/32045 



404571 










456497 


AW967956 


Hs.123648 


ESTs, Weakly similar to AF108460 1 ubinu 




419968 


X04430 


Hs.93913 


interleukin 6 (intsrferon, beta 2) 




433172 


AB037841 


Hs.102652 


hypothetical protein ASH1 




422631 


BE218919 


Hs.1 18793 


hypothetical protein FU1068B 


jj™ 


427674 


NMJM3528 


Hs.2178 


H2B histone family, member Q 




404915 










452259 


AA317439 


Hs.28707 


signal sequence receptor, gamma (translo 




452891 


N75582 


Hs.212875 


ESTs, Weakly similar to DYH9_HUMAN CILIA 




439731 


AI953135 


Hs.45140 


hypothetical protein FU 14084 


703 


419839 


U24577 


Hs.93304 


phospholipase A2, group VII (platelet-ac 




420120 


AL049610 


Hs.95243 


transcription elongation factor A (Sll)- 




424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 




448706 


AW291095 


Hs.21814 


interleukin 20 receptor, alpha 


7.52 


410227 


AB009284 


Hs.61152 


exostoses (multiple)-like 2 




425211 


M18667 


Hs.1867 


progastricsin (pepsinogen C) 




441736 


AW292779 


Hs.169799 






419991 


AJO00098 


Hs.94210 


eyes absent (Drosophila) homolog 1 




425018 


BE245277 


Hs.154196 


E4F transcription factor 1 




424560 


AA158727 


Hs.150555 


protein predicted by clone 23733 




409110 


M191493 


Hs.48778 


niban protein 




421566 


NM_000399 


Hs.1395 


early growth response 2 (Krox-20 (Drosop 


7.Q4 


431725 


X65724 


Hs.2839 


Norrie disease (pseudoglioma) 




425782 


U66468 


Hs.159525 


cell growth regulatory with EF-hand doma 




427408 


AA5B3206 


Hs.2156 


RAR-related orphan receptor A 




435604 


AA625279 


Hs.26892 


uncharacterized bone marrow protein BM04 


fiS 


415874 


AF091622 




KIAA0244 protein 




401451 










431778 


AL080276 


Hs.268562 


regulator of G-protein signalling 17 


6.51 


409089 


NM_014781 


Hs.50421 


KIAA0203 gene product 




431992 


NM_002742 


Hs.2891 


protein kinase C, mu 




404253 








6.42 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 




416806 


NM_000288 


Hs.79993 


peroxisomal biogenesis factor 7 




431958 


X63S29 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 




439366 


AF100143 


Hs.6540 


fibroblast growth factor 13 




416836 


D54745 


Hs.80247 


cholecystokinin 


6.30 


433383 


AF034837 


Hs.192731 


double-stranded RNA specific adenosine d 




450728 


AW162923 


Hs.25363 


presenilin 2 (Alzheimer disease 4) 


625 


413384 


NM_000401 


Hs.75334 


exostoses (multiple) 2 




423349 


AF010258 


Hs.127428 


homeoboxA9 




424800 


AL035588 


Hs.153203 


MyoD family inhibitor 




425451 


AF242769 


Hs.157461 


mesenchymal stem cell protein DSC54 


6.14 


447359 


NM_012093 


Hs.18268 


adenylate kinase 5 




410889 


X91662 


Hs.66744 


twist (Drosophila) homolog (acrooephalos 




408829 


NMJXJ6042 


Hs.48384 


heparan sulfate (glucosamine) 3-O-sulfot 




453911 


AW503857 


Hs.4007 


Sarcotemmal-associated protein 




408875 


NM_015434 
X82125 


Hs.48604 


OKFZP434B168 protein 




450480 


Hs.25040 


zinc finger protein 239 




451684 


AF216751 


Hs.26813 


CDA14 


400301 


X03635 


Hs.1657 


estrogen receptor 1 




415077 


L41607 


Hs.934 


glucosamlnyl (N-acetyl) transferase 2, 1 


574 


418852 


BE537037 


Hs.273294 


hypothetical protein FU20O69 


5.72 


446867 


AB007891 


Hs.16349 


KIAA0431 protein 




410232 


AW372451 


Hs.61184 


CGI-79 protein 




422762 


AL031320 


Hs.119976 


Human DNA sequence from clone RP1-20N2 o 






AL1 33067 


Hs.302689 




5.70 


408621 


AI970672 


Hs!46638 


chromosome 11 open reading frame 8 


5.65 


439671 


AW162840 


Hs.6641 


kinesln family member 5C 


5.64 


410196 


AI936442 


Hs.59838 


hypothetical protein FU10808 


5.60 


429170 


NM 001394 


Hs.2359 


dual specificity phosphatase 4 


5.60 


440733 


A1004650 


Hs.225674 


WO repeat domain 9 


5.60 


414342 


M742181 


Hs.75912 


KIM0257 protein 


5.59 


422634 


NMJJ16010 


Hs.1 18821 


CGI-62 protein 


5.56 
5.55 


400268 
439569 


AW602166 


Hs.222399 


CEGP1 protein 


5.51 


452823 


AB012124 


Hs.30696 


transcription factor-like 5 (basic helix 


5.48 


431938 


AA938471 


Hs.54431 


specific granule protein (28 kDa); cyste 


5.44 


427638 


AA406411 


Hs.208341 


ESTs, Weakly similar to KIAA0989 protein 


5.42 
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AL039123 


Hs.1 03042 


microtubuls-associaled protein IB 




421685 


AF1 89723 


Hs.1 06778 


ATPase, Ca++ transporting, type 2G, memb 




421987 


AI133161 


Hs.286131 


CGI-101 protein 




422806 


BE314767 


Hs.1581 


glutathione S-transferase theta 2 




432281 


AK001239 


Hs.274263 


hypothetical protein FU1 10377 




451982 


F13036 


Hs.27373 


Homo sapiens mRNA; cDNA DKFZp56401763 (f 


5.32 


444042 


NMJ04915 


Hs.10237 


ATP-binding cassette, sub-family G (WHIT 




447752 


M73700 


Hs.1 05938 


lactotransferrin 




451418 


BE387790 




hypothetical protein FU20287 


Km 


428593 


AW207440 


Hs.1 85973 


degenerative spermatocyte (homolog Droso 




447541 


AK000288 


Hs.18800 


hypothetical protein FLJ20281 




459294 


AW977286 


Hs.17428 


RBPI-like protein 


5.16 


424692 


AA429834 


Hs.151791 


KIAA0092 gene product 




416434 


AW163045 


Hs.79334 


nuclear factor, interleukin 3 regulated 




410268 


AA316181 


Hs.61635 


six transmembrane epithelial antigen of 




417517 


AF001176 


Hs.82238 


POP4 (processing of precursor , S. cerev 


■! in 


453616 


NMJ303462 


Hs.33846 


dynein, axonemal, light intermediate pol 




427958 


AA418000 


Hs.98280 


potassium intermediate/small conductance 




407945 


X69208 


Hs.606 


ATPase, Cu++ transporting, alpha polypep 




418576 


AW968159 


Hs.289104 


Alu-binding protein with zinc finger dom 


5.05 


413328 


Y15723 


Hs.75295 


guanylate cyclase 1, soluble, alpha 3 


5.04 


432729 


AK000292 


Hs.278732 


hypothetical protein FU20285 




426342 


AF093419 


Hs.169378 


multiple PDZ domain protein 




429782 


NM.005754 


Hs.220689 


Ras-GTPase-activating protein SH3-domain 




436209 


AW850417 


Hs.254020 


ESTs, Moderately similar to unnamed prot 


5.02 


430599 


NM_004855 


Hs.247118 


phosphatidylinositol glycan, class B 




451386 


ABO29006 


Hs.26334 


spastic paraplegia 4 (autosomal dominant 




457211 


AW972565 


Hs.32399 


ESTs, Weakly similar to S51797 vasodilat 


4.97 


425851 


NM_001490 


Hs.159642 


glucosaminyl (N-acetyl) transferase 1 , c 




421689 


N87820 


Hs.106826 


KIAA1 696 protein 




416533 


BE244053 


Hs.79362 


retinoblastoma-like 2 (p130) 




432653 


N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino aci 




403047 
431117 


AF003522 


Hs.25O500 


delta (Drosophila)-like 1 




427617 


D42063 


Hs.199179 


RAN binding protein 2 




428804 


AK000713 


Hs.1 93736 


hypothetical protein FU20706 




449071 


NM_005872 


Hs.22960 


breast carcinoma amplified sequence 2 


1'fli 


407596 


R86913 




gb:yq30f05.r1 Scares fetal liver spleen 




456516 


BE172704 


Hs.222746 


KIAA1 610 protein 




458339 


AW976853 


Hs.172843 


ESTs 




422083 


NM 001141 


Hs.111258 


arachidonate 15-lipoxygenase, second typ 




449535 


W15267 


Hs.23672 


low density lipoprotein receptor-related 




422048 


NMJ12445 


Hs.268126 


spondin 2, extracellular matrix protein 




424602 


AK002055 


Hs.151046 


hypothetical protein FU11 193 




410765 


AI694972 


Hs.66180 


nucleosome assembly protein Mike 2 




419879 


Z17805 


Hs.93564 


Homer, neuronal immediate early gene, 2 




450649 


NM.001429 


Hs.25272 


E1A binding protein p300 




411624 


BE145964 


Hs.103283 


KIAA0594 protein 




404721 










426261 


AW242243 


Hs.1 68670 


peroxisomal famesylated protein 




416276 


U41060 


Hs.79136 


LIV-1 protein, estrogen regulated 




403374 


AW025430 


Hs.155591 


forkheadboxFI 


4.64 


451900 


AB023199 


Hs.27207 


KIAA0982 protein 




421437 


AW821252 


Hs.104336 


hypothetical protein 


4.63 


434629 


AA789081 


Hs.4029 


glioma-amplified sequence-41 




403764 








458 




BE391727 


Hs 102910 


general transcription factor IIH polype 


4.53 


403721 








4.50 


453070 


AK001465 


Hs.31575 


SEC63, endoplasmic reticulum translocon 


4.49 


417412 


X16896 


Hs.82112 


interleukin 1 receptor, type I 


4.48 


439735 


AI635386 


Hs.142846 


hypothetical protein 


4.48 


430261 


AA305127 


Hs.237225 


hypothetical protein HT023 


4.46 


430598 


AK001764 


Hs.247112 


hypothetical protein FLJ10902 


4.44 


400303 


AA242758 


Hs.79136 


LIV-1 protein, estrogen regulated 


4.42 




AL120659 


Hs.6111 


aryl-hydrocarbon receptor nuclear transl 


4.42 


417421 


AL138201 


Hs.82120 


nuclear receptor subfamily 4, group A, m 


4.40 


447270 


AC002551 


Hs.331 


general transcription factor IIIC, polyp 


4.38 


434423 


NM_006769 


Hs.3844 


LIM domain only 4 


4.35 


404561 








4.32 
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AA782536 


Hs.122647 


425071 


BE350494 
NM_013989 


Hs.49753 
Hs.154424 




AL042613 


Hs.262476 




AK001741 




423740 


Y07701 


Hs.293007 




NM_005923 


Hs.151988 




NM_002914 


Hs.1 39226 




AB014515 


Hs.323712 


447124 


AW976438 


Hs.17428 


438018 


AK001 160 
AI089292 


Hs.5999 
Hs.2B7621 




AF1 69692 




405403 








NM_016578 


Hs.20509 


417531 


NM 003157 




433345 


AI681545 


Hs!l52982 


432712 


AB016247 


Hs.288031 


435114 


AA775483 


Hs.288936 


445459 


AI478629 


Hs.158465 








438660 


U95740 


Hs.6349 


447568 


AF1 55655 


Hs.18885 


452211 


AI985513 


Hs.233420 


443292 


AK000213 


Hs.9196 


420911 


U77413 


Hs.100293 


428738 


NM_000380 


Hs.192803 


430456 


AA314998 


Hs.241503 


437531 


AI400752 


Hs.1 12259 
Hs.189999 


428695 
410011 


AI355647 
AB020641 


Hs.57856 


446494 


AA463276 


Hs.288906 


409928 


AL137163 


Hs.57549 


411598 


BE336654 


Hs.70937 


425707 


AF1 15402 




451806 


NMJJ03729 


Hs.27076 


401045 




Hs.283631 


437372 


AA323968 
AJ001417 


Hs.81086 


410467 


AF1 02546 


Hs.63931 


431930 


AB035301 


Hs.272211 


453047 


AW023798 


Hs.286025 


401785 






458229 


A1929602 




406414 






412494 






418329 


AW247430 




424850 


M151057 


Hs.153498 


427585 




Hs.179729 


423052 


M28214 


Hs.123072 


416111 


AA033813 


Hs.79018 


419423 


D25488 


Hs.90315 


429643 


AA455889 


Hs.167279 


431499 


NM_001514 


Hs.258561 


444078 


BE246919 


Hs.10290 




AV660345 


Hs.238126 


431637 


AI879330 


Hs.265960 


440411 


N30256 


Hs.151093 


405917 






451230 


BE546208 


Hs.26090 


429597 


NM 003816 


Hs.2442 


415075 


L27479 


Hs.77889 


440351 


AF030933 


Hs.7179 


443603 


BE502601 


Hs.134289 


446965 


BE242873 


Hs.16677 


412350 


AI659306 


Hs.73826 


433852 


AI378329 


Hs.126629 


447397 


BE247676 


Hs.18442 



uvaal autoantigen with coiled coil domai 
deiodinase, iodothyronine, type II 
S-adenosylmethionine decarboxylase 1 
hypothetical protein FLJ10879 
aminopeptidase puromydn sensitive 
mitogen-activated protein kinase kinase 
replication factor C (activator 1) 2 (40 
KIAA0615 gene product 
RBPMike protein 
hypothetical protein FLJ10298 
hypothetical protein FLJ14069 
protocadherin 9 

HBV pX associated protein-8 
serine/threonine kinase 2 
hypothetical protein FU131 17 
sterol-C5-desaturase (fungal ERG3, delta 
mitochondrial ribosomal protein L9 
likely orthoiog of mouse putative IKK re 

Homo sapiens, clone IMAGE:3010666, mRNA, 

CGI-1 16 protein 

ESTs 

hypothetical protein 

O-linked N-acetylglucosamine (GlcNAc) tr 
xeroderma pigmentosum, complementation g 
hypothetical protein 
T cell receptor gamma locus 
purinergic receptor (family A group 5) 
PFTAIRE protein kinase 1 
WW Domain-Containing Gene 
hypothetical protein dJ473B4 
H3 histone family, member A 
E74-like factor 5 (ets domain transcript 
RNA3'-terminalph ' 



hypothetical protein DKFZp547G183 
solute carrier family 22 (extraneuronal 
dachshund (Drosophila) homolog 
cadherin7,type2 



phosphatidyl! nasitol glycan, class H 

ADP-ribosylation factor domain protein 1 
cystathionine-beta-synthase 
chromosome 18 open reading frame 1 
collagen, type X, alpha 1 (Schmid metaph 
RAB3B, member RAS oncogene family 
chromatin assembly factor 1 , subunit A ( 
KIAA0007 protein 

FYVE-finger-containing Rab5 effeotor'pro 
general transcription factor IIB 
U5 snRNP-specifio40 kDa protein (hPrp8- 
CGI-49 protein 

hypothetical protein FU10563 
hypothetical protein DKFZp434G141S 

hypothetical protein FU20272 

a disintegrin and metalloproteinase doma 

Friedreich ataxia region gene X123 

RAD1 (S.pombe) homolog 

ESTs, Weakly similar to KIAA1 063 protein 

WD repeat domain 15 
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Hs.155174 


CDC5 (cell division cycle 5, S. pombe, h 


3.68 






Hs.107444 


Homo sapiens cONA FU20562 lis, clone KA 


3.67 








von Hippei-Lindau syndrome 


3.67 
3.66 






Hs.31092 




3.66 


419078 


M93119 


Hs.89584 


insulinoma-associated 1 


3.66 










3.65 




AI382555 


Hs.127950 




3.65 


446320 


AF1 26245 


Hs!l4791 


acyl-Coenzyme A dehydrogenase family, me 


3.63 


404939 








3.62 










3.60 




AV/005505 




y p gp 




404210 








359 




AF291664 


Hs.204732 






452501 


AB037791 


Hs.29716 


hypothetical protein FLJ10980 


3.56 




NM_014707 


Hs.1 16753 


histone deacetylase 7B 


3.55 


420230 


AL034344 


Hs.284186 


forkhead box C1 


3.55 








thyroid hormone receptor interactor 1 1 


3.54 


433002 


AF048730 


Hs.279906 




3.53 










3.52 


457940 


AL360159 


Hs.306517 


Homo sapiens TRIpartite motif protein ps 


3.52 
3.52 


402444 
418250 


U29926 


Hs.83918 


adenosine monophosphate deaminase (isofo 


3.si 


414222 


AL13S173 


Hs.878 


sorbitol dehydrogenase 


3.51 


422384 


AA224077 


Hs.42438 


Sm protein F 


3.50 


447805 


AW627932 


Hs.19614 


gemlrvt 


3.50 


454265 


H03556 


Hs.300949 


ESTs, Weakly similar to thyroid hormone 


3.50 


423445 


NM_014324 


Hs.128749 


alpha-methylacyl-CoA racemase 


3.48 


413435 


X51405 


Hs.75360 


carboxypeptidase E 


3.46 


447210 


AF035269 


Hs.17752 


phosphatidylserine-spedficphospholipas 


3.46 


426931 


NM 003416 


Hs.2076 


zinc finger protein 7 (KOX 4, clone HF.1 


3.45 


403418 


AW963897 


Hs.44743 


KIAA1435 protein 


3.45 


421887 


AW161450 


Hs.1 09201 


CGI-86 protein 


3.44 
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Table 7: 42 GENES ENCODING SMALL MOLECULE TARGETS UP-REGULATED IN 
PROSTATE CANCER COMPARED TO NORMAL ADULT TISSUES 

5 Table 7 shows 42 genes up-regulated in prostate cancer compared to normal adult tissues that 
are likely to be small molecule targets. These were selected as for Table 5 and the predicted 
protein contained a structural domain that is indicative of a drugable structure (e.g. protease, 
kinase, phosphatase, receptor). The functional domain is indicated for each gene. 

10 Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

PSDomain: Protein Structural Domain 

15 R1 : Ratio of tumor vs. normal tissue 



Pkey 


ExAccn 


426747 
400299 


AA535210 
X07730 


420757 


X78592 








U29589 


407021 


D5Q640 


401424 




410001 


AB041036 


428330 
424099 


L22524 
AF071202 


419991 


AJ000098 


431992 


NMJM2742 


447359 


NM 012093 


400301 
421685 


X03635 
AF189723 


444042 


NMJXJ4915 


447752 


M73700 


407945 


X69208 


403047 




427617 


D42063 


422083 


NM.001141 


449535 


W15267 


425071 


NM_013989 


423740 


Y07701 


424701 


NM 005923 


424085 


NM 002914 


417531 


NM 003157 


428695 


AI355647 


410011 


AB020641 


424850 


AA151057 
AI659306 


412350 
447397 


BE247676 


452946 
427144 


X95425 
X95097 


443775 


AF291664 


457940 


AL360159 


418250 


U29926 


413435 


X51405 


447210 


AF035269 



Hs.171995 kallikrein 3, (prostate specific antigen 

Hs.171995 kallikrein 3, (prostate specific antigen 

Hs.99915 androgen receptor (dihydrotestosterone r 

Hs.44926 dipeptidylpeptidase IV (CD26, adenosine 

Hs.2551 adrenergic, beta-2-, receptor, surface 

Hs.68583 mitochondrial intermediate peptidase 

Hs.7138 cholinergic receptor, muscarinic 3 

Hs.337616 phosphodiesterase 3B, cGMP-inhibited 



PSDomain 

trypsin 
trypsin 

Androgen_recep,hormone_rec,zf-C4 

DPPIV_N_term,Peptidase_S9 

7tm_1 



trarsposase gene, comp SET.TransposaseJ 



19.72 
1628 
15.40 
14.81 
12.04 
11.10 
11.02 



in 11 



Hs.106778 
Hs.10237 
Hs.105938 
Hs.606 

Hs.199179 
Hs.1 11256 
Hs.23672 
Hs.154424 
Hs-293007 
Hs.151988 



matrix metalloproteinase 7 (matrilysin, 
ATP-binding cassette, sub-family C (CFTR 
eyes absent (Drosophila) homolog 1 
protein kinase C, mu 
adenylate kinase 5 
estrogen receptor 1 

ATPase, Ca++ transporting, type 2C, memb 
ATP-binding cassette, sub-family Q (WHIT 



Hs.1087 



Hs.57856 



Hs.73826 

Hs.18442 

Hs.31092 

Hs.2126 

Hs.204732 

Hs.306517 



ATPase, Cu++ Iransporting, alpha polypep 

RAN binding protein 2 
arachidonate 15-lipoxygenase, second typ 
low density lipoprotein receptor-related 
deiodinase, iodothyronine, type II 
aminopeptidase puromycin sensitive 
mitogen-activated protein kinase kinase 
replication factor C (activator 1) 2 (40 
serine/threonine kinase 2 
purinergic receptor (family A group 5) 
PFTAIRE protein kinase 1 
chromosome 18 open reading frame 1 
protein tyrosine phosphatase, non-recept 
E-1 enzyme 
EphA5 

vasoactive intestinal peptide receptor 2 



phosphatidylserine-specific phospholipas lipase 



trypsin 9.0 

Peptidase_M10 8.7 

ABC_tran,ABC_membrane 7.6 

Hydrolase 7.2 

pkinase,DAG_PE-bind,PH 6.4 

adenylatekinase 6.0 

Oest_recep^f-C4,honnone_rec 5.7 

E1-E2_ATPase,Hydrolase 5.3 

ABC_tran 5.3 

transferrin,7tm_1 52 

E1-E2_ATPase,Hydrolase,HMA 5.0 

trypsin 4.9 
Ran_BP1 ,zf-RanBP,TPR,pro_isomerase 

lipoxygenase.PLAT 4.8; 

ldLrecept_b,ldLrecept_a,EGF 4.8 

T4_deiodinase 4.3 

Peptidase_M1 42 

pkinase 42 

AAA,ViraLhelicase1 42 

pkinase * 4.1; 

7tm_1 3.9 

pkinase 3.9 

ldl_recept_a 3.8. 

Y_phosphatase,Band_41,PDZ 3.7 

Hydrolase 3.6 

EPH_lbd,fn3,pkinase,SAM 3.6 

7tm_2 3.6 

Peptidase_M10 3.5- 

SPRY,7tm_1 3.5; 

A_deaminase 3.5 

Zn_carbOpept 3.4 
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TABLE 8: 136 GENES SIGNIFICANTLY DOWN-REGULATED IN PROSTATE 
CANCER COMPARED TO NORMAL PROSTATE 

Table 8 shows 136 genes significantly down-regulated in prostate cancer compared to normal 
prostate . These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip 
array such that the ratio of "average" normal prostate to "average" prostate cancer tissues was 
greater than or equal to 2. The "average" normal prostate level was set to the mean amongst 4 
normal prostate tissues. The "average" prostate cancer level was set to the 85 th percentile 
amongst 73 tumor samples. In order to remove gene-specific background levels of non- 
specific hybridization, the 10 th percentile value amongst all the tissues was subtracted from 
both the numerator and the denominator before the ratio was evaluated. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbankaccession number 

UnigenelD; Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of normal prostate to prostate cancer 

Pkey ExAccn UnigenelD Unigene Title R1 

425932 M81650 Hs.1968 semenogelin I 57.69 

425545 N98529 Hs.158295 Human mRNA for myosin light chain 3 (MLC 19.70 

426752 X69490 Hs.172004 titin 15.25 

442082 R41823 Hs.7413 ESTs; calsyntenin-2 10.05 

407245 X90568 Hs.172004 titin 9.38 

422711 D60641 Hs.21739 Homo sapiens mRNA; cDNA DKFZp586l1518 (f 9.05 

420813 X51501 Hs.99949 prolactin-induced protein 8.18 

411987 AA375975 Hs.183380 "ESTs, Moderately similar to ALU7_HUMAN 7.45 

404567 5.62 

416030 H15261 Hs.21948 ESTs 5.51 

444892 AI620617 Hs.148565 ESTs 527 

444573 AW043590 Hs.225023 ESTs 520 

428068 AW016437 Hs.233462 ESTs 5.08 

437440 AA846804 Hs.123694 ESTs 4.95 • 

404113 4.75 

452279 AA286844 Hs.61260 hypothetical protein RJ13164 4.75 

421058 AW297967 Hs.188181 ESTs 4.63 

445592 AV654382 Hs.17947 "ESTs, Weakly similar to K02F3.10 [Cele 4.53 

405163 4.49 

405227 4.45 

454059 NM_003154Hs.37048 statherin 4.45 

450152 AI138635 Hs.22968 ESTs 4.40 

407013 U35637 "gb:Human nebulin mRNA, partial cds" 4.03 

403612 4.02 

440089 AA864438 Hs.135646 ESTs 4.00 

408988 AL1 19844 Hs.49476 Homo sapiens clone TUA8 Cri-du-chat regi 3.98 

438726 AA324975 Hs.128993 "ESTs, Weakly similarto KIAA0465 protei 3.95 

459367 BE148877 ■gb:CM4-HT0244-111199-040-h12 HT0244Hom 3.95 

427318 AF186081 Hs.175783 zinc transporter 3.92 

411762 AW860972 "gb:QV<M5T0387-180300-167-h07CT0387Hom 3.85 

418658 AW407987 Hs.87150 Human clone A9A2BR11 (CAC)n/(GTG)n repea 3.75 

458311 AF069478 "gb:AF069478 Homo sapiens astrocytoma li 3.61 

403649 3.60 

419682 H13139 Hs.92282 paired-like homeodomain transcription fa 3.58 

412519 AA196241 Hs.73980 "troponin T1, skeletal, slow" 3.51 

414206 AW276887 Hs.46609 ESTs 3.45 

427419 NM 000200Hs.1 77888 histatin 3 3.37 

420777 AA280223 Hs.1 30865 ESTs 3.35 

428134 AA421773 Hs.161008 ESTs 3.31 

450218 R02018 Hs.1 68640 "Ank, mouse, homolog of 3.30 

433474 AI192195 Hs.147174 "EST, Highly similar to ubiquitin-protei 3.30 

418833 AW974899 Hs.292776 ESTs 3.26 

400440 X83957 Hs.83870 nebulin 3.16 
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413778 AA090235 Hs.75535 "myosin, light polypeptide 2, regulatory 

423151 AW838068 *gb:QV3-LT0048-010300-109-f02LT0048 Horn 

445060 AA830811 Hs.88808 ESTs 

457055 AI476318 Hs.1 92480 ESTs 

4324S6 H00093 "gb:ph8f12u_19/'1TV Outward Alu-primed hn 

405678 

406707 S73840 Hs.931 "myosin, haavy polypeptide 2, skeletal m 

444105 AW1 89097 Hs.166597 ESTs 

433968 AL157518 Hs.90421 PR02463 protein 

438522 AA809431 Hs.258886 ESTs 

436562 H71937 Hs.1 69756 "complement component 1,s subcomponent" 

412417 AA10226B Hs.42175 ESTs 

455590 BE072259 "gb:QV4-BT0536-271 299-059-g04 BT0536 Horn 

415380 F07953 Hs.160S5 putative G-protein coupled receptor 

428729 AL162331 Hs.191436 hypothetical protein FU 1061 9 

408537 AW207734 "gb:UI-H-BI2-age-h-01-0-Ul.s1 NCI_CGAP_S 

424706 AA741336 Hs.152108 transcriptional unit N143 

413212 BE072092 'gb:PM4-BT0532-1 60200-003-b1 1 BT0532 Horn 

406704 M21665 Hs.929 "myosin, heavy polypeptide 7, cardiac mu 



410384 AI933794 Hs.42745 ESTs 

408074 R20723 Hs.124764 ESTs 

436653 AA829828 Hs.292402 ESTs 

458090 AI282149 Hs.56213 "ESTs, Highly similar to FXD3JHUMAN FORK 

432003 AI689154 Hs.122972 ESTa 

436915 AA737400 Hs.142230 ESTs 

410028 AW576454 Hs.258553 ESTs 

448920 AW408009 Hs.22580 alkylglycerone phosphate synthase 

422046 AI638562 "gb:ts50a10jc1 NCI_CGAP_Ut1 Homo sapiens 

451122 AA015767 Hs.193587 ESTs 

422646 H87863 Hs.151380 ESTs 

451237 AW600293 "gb:ESTO0O49 pGEM-T library Homo sapiens 

400001 AFFX control: BioB-3 

415835 245365 "gb:HSC2NF061 normalized infant brain cD 

439706 AW872527 Hs.59761 ESTs 

423341 AW242394 Hs.252495 ESTs 

436486 AA742221 Hs.1 20633 ESTs 

407449 AJ002784 gb;Homo sapiens mRNA; fetal brain cDNA 5 

430573 AA744550 Hs.1 36345 ESTs 

401974 

443356 AL044498 Hs.133262 "ESTs, Weakly similar to PH0217 reverse 

430751 NM_012471Hs247868 transient receptor potential channel 5 

439128 AI949371 Hs.153089 ESTs 

448765 R15337 Hs.21958 "Homo sapiens cDNA FU10532 fis, clone N 

451130 AI762250 Hs.211347 ESTs 

405420 

455029 AW851258 "gb:IL3-CT0220-160200-066-H06 CT0220 Horn 

438224 AA933999 "gb:on91f04.s1 Soares_Na_T_GBC_S1 Homo 

407764 BE008347 "gb:CMO-BN0154-080400-325-h04BN0154Hom 

413549 BE252470 "gb:601 108292F1 NIH_MGC_16 Homo sapiens 

437010 AA741368 Hs.291434 ESTs 

435111 AI914279 Hs.213740 ESTs 
403375 

455060 AW853441 "gb:RC1-CT0252-030100-023-gOSCT0252Hom 

409792 AVV854153 •gb:RC3-CT0254-060400-029-d03 CT0254 Horn 

421154 AA284333 Hs.287631 "Homo sapiens cDNA FU14269 fis, clone P 
401963 

435034 AF168711 Hs.159397 x 010 protein 

448996 AW998989 Hs.105749 KIAA0553 protein 

436816 AW297599 Hs.255667 ESTs 

442252 AI733395 Hs.129124 ESTs 

419310 AA236233 Hs.188716 ESTs 

418579 H91800 Hs.124156 ESTs 

423315 R54109 Hs.26096 ESTs 

432744 AA988835 Hs.38664 ESTs 

424492 AI133482 Hs.165210 ESTs 

424770 AA425562 'Bb:zw46e05.r1 Soares_totalJetus_Nb2HF8 

437101 AA744518 Hs.120610 ESTs 

428793 AC0O4957 Hs.298975 "ESTs, Highly similar to collapsin-2-lik 
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415708 H56475 "gb:yt87d11.r1 Soares_pinaal_gland_N3HPG 2.13 

459619 2.12 

427506 AK000134 Hs.179100 hypothetical protein FU20127 2.12 

452508 M804174 Hs.1 84354 ESTs 2.10 

410881 AW809157 "gb:RC0-ST01 18-041099-031-C07J ST01 1 8 Homo sapiens cDNA, mRNA sequence" 2.10 

403087 2.10 

403869 2.10 

445028 D81194 Hs.282499 ESTs 2.10 

447884 H29505 "gb:ym60d10.r1 Soares infant brain 1NIB Homo sapiens cDNA clone 5', mRNA sequence" 2.10 

414575 H11257 Hs.295233 ESTs 2.09 

420351 BE218221 Hs.190044 ESTs 2.08 

"gb:601121068F1 NIH_MGC_2Q Homo sapiens cDNA clone S\ mRNA sequence" 2.08 

2.08 



427046 BE246180 Hs.121385 ESTs 
403493 

444514 AI682905 Hs.270431 "ESTs, Weakly similarlo ALU1_HUMAN ALU SUBFAMILY J SEQUENCE 

CONTAMINATION WARNING ENTRY [H.sapiensj" 
435884 AA701443 Hs.192868 ESTs 
419629 AB020695 Hs.91662 KIAA0888 protein 
405900 

457350 AW974438 Hs.1 94136 "ESTs, Moderately similar to AF091457 1 zinc finger protein RIN ZF [R.norvegicus]" 
400007 AFFX control: BioDn-5 

406978 M64358 "gb:Human rhom-3gene, exon." 
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TABLE 8A shows the accession numbers for those primekeys lacking a unigenelD in Table 
8. For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Unique Eos probeset identifier number 



Genbank accession numbers 



Pkey CAT number Accessions 



407764 1014849J 

408537 1064753J 

409792 1154677J 

410881 1225682J 

411762 1256906J 

413212 1353792J 

413549 1375933_2 

415708 1548209J 

415835 1558511J 

422046 210744J 

423151 225415J 

423843 232510J 

424770 243504J 

426998 274259_-1 

3 347718_2 



AW207734 D60164 D81150 D81078 D61356 AW996804 

AW854153 AW500210 BE145772 AW501310 

AW809157 AWB12181 AW812175 AW812172 AW812161 AW812165 

AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860925 AW86Q922 AW860986 AW860984 AW860989 

BE072092 BE072106 BE072C86 BE072098 BE072103 

BE252470BE147573 

H56475 F29401 F34552 

Z45365 R25905 H05203 T77496 

i H13401 F07773 R55836 



AA332652 AA331633 AW999369 AW902993 BE170475 AA378845 AW9S41 75 AI475221 

AA425562 AI880208 AA346646 N22655 AW81 1775 AW81 1786 

BE2743S0 

H00093 H00079 H00070 H00054 H00049 H00063 AW905306 AW905241 AW905410 AW905307 AW905411 

AW905352 AW905304 AW905239 AW905242 AW905243 H00087 
AA933999AA781181 

H29505 R18575 243580 T48738 AI435454 BE004683 
AW600293AI767468 

AW851258 AW851435 AW851106 AW851421 
AW853441 BE145228 BE145218 BE145162 BE145283 



455590 1335127J BE072259BE072230BE007911 
40 458311 543550J AF069478 AF069479AF069480 
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TABLE 8B shows the genomic positioning for those primekeys lacking unigene ID's and 
accession numbers in table 8. For each predicted exon, we have listed the genomic sequence 
source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the 

publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


401983 


3126783 


Plus 


51382-51521 


401974 


3126777 


Plus 




403087 


8954241 


Plus 


169511-169795 


403375 
403493 


9255944 
7341425 


Minus 
Plus 


92554-92795 

157568-159084 

94723-94859 


403612 
403649 


8469060 
8705159 


Minus 
Minus 


27141-27247 




7280046 


Minus 


34379-34583 


404113 


9588571 


Minus 


13446-13646 


404567 


7249169 




101320-101501 


405163 


9966267 


Minus 


161171-161299 


405227 


6731245 


Minus 


22550-22802 


405420 


7211837 


Minus 


13428-13582 


405455 


7656675 


Plus 


134112-134671 


405678 


4079670 


Plus 


151821-152027 


405900 


6758795 


Minus 


71181-71535 


406135 


9164918 


Minus 


65489-65715 
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TABLE 9: 1001 GENES SIGNIFICANTLY UP-REGULATED IN NORMAL PROSTATE 
COMPATED TO PROSTATE CANCER 

Table 9 shows 1001 genes significantly up-regulated in prostate cancer compared to normal 
prostate. These were selected from 59680 probesets on the Affymetrix/Eos Hu03 GeneChip 
array such that the ratio of "average" normal prostate to "average" prostate cancer tissues was 
greater than or equal to 8.14. The "average" normal prostate level was set to the mean 
amongst 4 normal prostate tissues. The "average" prostate cancer level was set to the 85 th 
percentile amongst 73 tumor samples. In order to remove gene-specific background levels of 
non-specific hybridization, the 10 th percentile value amongst all the tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



Pkey: 


Unique Eos probeset identifier number 




ExAccn: 


Exemplar Accession number, Genbank accession number 




UnigenelD: 


Unigene number 




Unigene Title: 


Unigene gene title 




R1: 


Ratio of prostate cancer to normal prostate 




Pkey ExAccn 


UnlgenelD Unigene Title 




451002 AA013299 


Hs.8018 ESTs, Weakly similar to ALU3.HUMAN ALU S 


1684.00 


435596 AA689465 


Hs.188999 ESTs 


738.00 


443576 AI078027 


Hs.169338 ESTs 


246.86 


434247 AA928116 


Hs.272065 ESTs 


245.20 


400452 AK000185 


gb:Homo sapiens cDNA FU20178 fis, clone 


222.00 


405932 




221.33 


427906 AA864330 


Hs.16S520 ESTs 


212.00 


443685 AI686550 


Hs.174481 ESTs 


163.20 


451554 AI474866 


Hs.193237 ESTs 


149.45 


418323 NM_002118 


Hs.1 1 62 major histocompatibility complex, class 


126.11 


429480 M36860 


Hs.9295 elastin (supravalvular aortic stenosis, 




426025 AW138330 


Hs.233778 ESTs 


120.00 


418917 X02994 


Hs.1217 adenosine deaminase 


106.75 


404407 




105.71 


442027 AI652926 


Hs.128395 ESTs 


100.53 


433704 AA608684 


Hs.121705 ESTs, Moderately similar to ALUCJHUMAN ! 


94.00 


453758 U83527 


gb:HSU83527 Human fetal brain (Mlovett) 


89.18 


415354 F06495 


gb:HSC1AB051 normalized infant brain cDN 


87.73 


424239 M67439 


Hs.143526 dopamine receptor D5 


86.82 


444143 AW747996 


Hs.160999 ESTs 


86.43 


401672 




7723 


430590 AW383947 


Hs.246381 CD68 antigen 


68.47 


411972 BE074959 


gb:PMO-BT0582-310100-001-f08 BT0582 Homo 


68.00 


448992 AI766053 


Hs 188346 ESTs 


6126 


408828 BE540279 


gb:601059857F1 NIH MGCJ0 Homo sapiens c 


57.71 


409653 AW451693 


Hs.220826 ESTs 


" 56.40 


402964 




54.67 


422673 N59027 


gb:yv59d1 1 .r1 Soares fetal liver spleen 


54.00 


422568 AA372275 


Hs.279800 Homo sapiens cDNA FU1 1 383 fis, clone HE 


54.00 


438907 R32704 


Hs.301298 ESTs 


52.96 


405172 




52.98 


444897 AW137088 


Hs.144857 ESTs 


52.32 


458019 AW592931 


Hs.256298 ESTs 


51.63 


405275 AB028989 


Hs.88500 mitogen-activated protein kinase 8 inter 


50.98 


457815 AA703679 


Hs.1 06999 ESTs, Weakly similar to SYT5_HUMAN SYNAP 


49.60 


424385 AA339666 


gb:EST44776 Fetal brain I Homo sapiens c 


48.90 


407172 T54095 


gb:ya92c05.s1 Stratagene placenta (93722 


47.9B 


428202 AA424163 


Hs.156895 ESTs 


46.83 


435672 AI700148 


Hs.283626 ESTs 


4357 


420283 AA485224 




43.00 


417016 AA837098 


HS569933 ESTs 


42.70 


438854 AF074994 


Hs24240 ESTs 


42.67 
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406134 

457319 AA480895 

409314 AA070266 
401124 

429316 AI371157 

420317 AB006628 



417407 AA923278 

430269 BE221682 

10 439602 W79114 

433686 M604799 



428214 AA936282 



415911 H08796 

457502 AA076049 

421566 NM_000399 
401468 

458561 AI220150 



Hs.201552 ESTs, Weakly similar to T17288 hypotheti 
gb:zm69d04.r1 Stratagene neuroepithalium 

Hs.1 78538 ESTs 

Hs.96485 KIAA0290 protein 

gb:MRO-CT0060-120899-001-f08 CT0060 Homo 

Hs.290905 ESTs, Weakly similar to protease [H.sapi 

Hs.178364 ESTs 

HS.5B558 ESTs 
Hs.1 36528 



coagulation factor XIII, A1 polypeptide 
hypothetical protein FU10257 
ESTs 

Homo sapiens cDNA FU10229 fis, clone HE 
early growth response 2 (Krox-20 (Drosop 



454977 AW848032 
402828 

414522 AW518944 
402842 

421245 AA2S5363 

401631 F05183 

408057 AW139565 

408069 H81795 

438694 T87479 

449156 AF103907 

428796 AU076734 

452549 AI907039 

410129 BE244074 

414464 AI870175 



Hs.211195 ESTs 

Hs.123993 ESTs, Weakly similar to T00366 hypotheti 

gb:lL3-CT0214-231299-053-D1 1 CT0214 Homo 

Hs.76325 Homo sapiens cDNA: FU231 25 fis, clone L 

gb:HTH280 HTCDL1 Homo sapiens cDN A 573' 
Hs.1799 CD1D antigen, d polypeptide 

gb:U l-H-Bd-aea-d-04-O-U I.s1 NCI_CGAP_Su 
gb:ys68a10.r1 Soares retina N2b4HR Homo 



Hs.291797 
Hs.171353 
Hs. 193665 



solute carrier family 28 (sodium-coupled 
gb:PM-BT1 34-020499-566 BT134 Homo sapien 
regulator of Fas-induced apoptosis 



Small inducible cytokine A3 (homologous 
gb:zb03a12.r1 Scares JetalJung_NbHL19W 
4 ESTs 

single-minded (Drosophila) homolog 2 
443412 W84893 ms.9305 angiotensin receptcr-like 1 
457324 AB028990 Hs543901 KIAA1067 protein 

Hs.234734 lysozyme (renal amyloidosis) 
Hs.178140 ESTs 

Hs.128749 alpha-methylacyl-CoAracemase 
Hs.123426 ESTs, Weakly similar to putative serine/ 
Hs.75765 GR02 oncogene 



459081 W07808 
448702 AW1O2670 
451939 U80456 



424247 X14008 
457140 AI279960 
444151 AW972917 
45 457669 AW104257 



38.73 
38.06 
36.69 
3559 
36.18 
36.10 
36.08 



31.59 
3156 
3154 
3150 
31.09 
29.78 
29.76 
29.59 
29.53 
29.47 
2952 



2B54 
28.18 
28.12 



407997 AW135429 Hs.243577 ESTs 26.96 

50 442115 AW452332 Hs557554 ESTs 26.36 

409038 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 26.34 

402838 26.32 
449846 AI979284 HS500552 ESTs " 2651 

417153 X57010 Hs.81343 collagen, type II, alpha 1 (primary oste 2650 

55 439792 NMJM4856 Hs.6684 KIAA0476 gene product 25.91 

450096 AI682088 Hs.223368 ESTs 25.60 

424196 AL1 33660 Hs.142926 Homo sapiens mRNA; cDNA DKFZp434M0927 (f 2557 

414246 BE391090 Hs.280278 EST 2557 

420848 NM_005188 Hs.99980 Cas-Br-M (murine) ecotropic retroviral t 25.48 

60 424778 AA251048 Hs.153042 lymphocyte antigen 9 25.42 

409126 AA063426 gb:zf70c08.s1 Soares _pineal_gland_N3HPG 2555 

Hs.31196 ESTs 2552 

gbSlflO Human retina cDNA randomly prim 25.01 

Hs.8509 ESTs, Weakly similar to C03_HUMAN COMPLE 24.85 

gb:RC1-BTO606-O9050O-O15-b04 BTO606 Homo 24.76 

Hs.153023 ESTs 24.74 

Hs.1 31770 ESTs 24.57 

Hs.1 1 900 Human DNA sequence from clone RP4-583P1 5 24.53 

Hs.105700 secreted frizzled-related protein 4 24.49 
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443668 U25758 Hs.1 34584 ESTs 24.49 

424800 AL035588 Hs.153203 MyoD family inhibitor 24.10 

453633 AA357001 Hs.34045 hypothetical protein FU20764 24.04 

430565 AL122081 Hs.244343 cadherin related 23 24.00 

5 433694 AI208611 Hs.12066 Homo sapiens CDNAFU1 1720 fis, clone HE 23.89 

451045 AA215672 gb:zr96e09.s1 NCI_CGAP_GCB1 Homo sapiens 23.83 

408583 AW449674 Hs.47359 ESTs 23.73 

444040 AF204231 Hs.1 82982 gokjin-67 23.62 

414182 AA136301 gb:zk93g04.s1 Soares_pregnant_uterus_NbH 23.39 

10 418678 NM_001327 Hs.167379 cancer/testis antigen 2320 

408380 AF123050 Hs.44532 diubiquitin 22.68 

456076 BE243877 Hs.76941 ATPase, Na+/K+ transporting, beta 3 poly 22.65 

418299 AA279530 Hs.83968 integrin, beta 2 (antigen CD1 8 (p95), ly 22.38 

444917 R68651 Hs.144997 ESTs 2226 

15 444331 BE387335 Hs.283713 ESTs 22.0B 

415788 AW628686 Hs.78851 KIM0217 protein 22.04 

410896 AW809637 gb:MR4-ST0124-261099-015-b07ST0124Homo 22.00 

412978 AI431708 Hs.820 homeoboxC6 21.95 

458418 AV653846 Hs.126261 Homo sapiens Chromosome 16 BAC clone CIT 21.94 

20 454791 BE071874 gb:RC2-BT0522-120200-014-a06 BT0522 Homo 21.84 

408748 J05500 Hs.47431 spectrin, beta, erythrocytic (includes s 2126 

416011 H14487 gb:ym18c10.r1 Soares intant brain 1 NIB H 2124 

440474 AI207936 Hs.7195 gamma-aminobutyric acid (GABA) A recepto 21.14 

447047 AI623698 Hs.246306 Homo sapiens cDNA: FU23529 fis, clone L 21.11 

25 426793 X89887 Hs.172350 HIR (histone cell cycle regulation defec 21.10 

409841 AW502139 gb:UI-HF-BR0p-ajr-e-Q5-0-Ul.r1 NIH_MGC_5 21.07 

405685 20.90 

457359 AI983207 Hs.1 92481 ESTs, Weakly similar to SYPHJHUMAN SYNAP 20.84 

423037 AA321355 Hs.285401 ESTs 20.74 

30 422355 AW403724 Hs.140 immunoglobulin heavy constant gamma 3 (G 20.73 

401201 20.73 

458278 W28912 Hs.129019 ESTs 20.68 

439097 H66948 gb:yr86d10.M Soares fetal liver spleen 20.67 

414875 H42679 Hs.77522 major histocompatibility complex, class 20.66 

35 400926 20.66 

451355 NM_004197 Hs.444 serine/threonine kinase 19 20.64 

446982 AW5O0221 Hs.43616 Homo sapiens mRNA for FLJ00029 protein, 20.61 

417105 X60992 Hs.81226 CD6 antigen 20.61 

405777 20.51 

40 424123 AW966158 Hs.58582 Homo sapiens cDNA FU12702 fis, clone NT 2020 

425009 X58288 Hs.154151 protein tyrosine phosphatase, receptor t 20.10 

443271 BE568568 Hs.195704 ESTs 19.98 

421064 AI245432 Hs.101382 tumor necrosis factor, alpha-induced pro 19.98 

418819 AA228776 Hs.191721 ESTs 19.94- 

45 457595 AA584854 gb:no09h11.s1 NCI_CGAP_Phe1 Homo sapiens 19.90 

404426 19-84 

412571 U43143 Hs.74049 tms-related tyrosine kinase 4 19.79 

431457 NMJM2211 Hs.256297 integrin, alpha 11 19.62 

414002 NM_0O6732 Hs.75678 FBJ murine osteosarcoma vira! oncogene h 19.57 

50 418994 AA296520 Hs.89546 Selectin E (endothelial adhesion molecul 19.56 

437158 AW090198 Hs.4779 KIAA1 150 protein 19-52 

437866 AA156781 Hs.83992 ESTs 19.44 

417421 AL138201 Hs.82120 nuclear receptor subfamily 4, group A, m - 19.34 

433057 X15675 Hs.296832 Human pTR7 mRNA for repetitive sequence 1922 

55 421730 AW449808 Hs.1 64036 glucosamine (N-acstyl)-6-su1fatase (Sanf 1921 

456557 AA284477 Hs.96618 ESTs 18.77 

440806 AI247422 Hs.129966 ESTs 18.76 

439845 AL355743 Hs.56663 Homo sapiens EST from clone 41214, full 18.65 

416155 AI807264 Hs.205442 ESTs, Weakly similar to AF1 176101 inner 18.64 

60 437820 AA769062 Hs.16029 ESTs, Weakly similar to alternatively sp 18.62 

450923 AW043951 Hs.38449 ESTs 18.59 

418329 AW247430 Hs.84152 cysfalhionine-beta-synthase 1858 

424537 AI673027 Hs.143271 ESTs 18.55 

447742 AF1 13925 Hs.19405 caspase recruitment domain 4 18.52 

65 415251 R42863 Hs.7124 ESTs 1847 

440770 AA912815 Hs.222078 ESTs 18.40 

407711 AI085846 Hs.25522 ESTs 18.32 

427157 U51166 Hs.173824 thymine-DNAglycosylase 1828 

409847 AW501751 Hs.279733 ESTs 18.15 
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431735 
401515 
444045 
442754 



Hs.176028 EST 

Hs.123136 leucine rich repeat and death domain con 

Hs.278615 ESTs 

Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 

Hs.293684 ESTs, Weakly similar to alternatively sp 

Hs.227209 DKFZP586F1019 protein 

Hs.1 95663 ESTs 

Hs.3618 hippocalcin-like 1 

AW977724 Hs.75968 thymosin, beta 4, X chromosome 



5 NS0866 

0 AI971131 

4 AL050102 

4 AI839114 



AI097439 Hs.1 35548 ESTs 

AL045825 Hs.210197 ESTs 

AB001914 Hs.170414 paired basic amino acid cleaving system 

432415 T16971 Hs.289014 ESTs 

15 427829 AI188225 Hs.127462 ESTs 

432516 R08003 Hs.188013 ESTs 

435259 AA152106 Hs.4859 cyclin L ania-6a 
414989 T81668 gb:yd29c04.r1 Soares fetal liver spleen 

444880 AW1 18683 Hs.154150 ESTs 

20 417651 R06874 Hs.268628 ESTs 

453457 AL037103 Hs.270599 ESTs, Weakly similar to unnamed protein 
Hs.143604 Kaiso 



419078 
417696 
25 431117 
455254 



AW877015 
U66468 
H0817O 



Hs.82401 CD69 antigen (p60, early T-ceil aclivati 
Hs.250500 delta (Drosophila)-like 1 

gb:QV2-PTO010-25O3O0-O96-f12 PT0010 Homo 
Hs.159525 cell growth regulatory with EF-hand doma 



426403 NM_000361 Hs.2030 thrombomodulin 

425905 AB032959 Hs.161700 KIM1 133 protein 

438887 AW451157 Hs.181157 ESTs 

420940 AA830664 Hs.143974 ESTs 

459234 AI940425 gb:CMO-CT0052-150799-024-c04 CT0052 Homo 



422247 U18244 

420568 F09247 

443559 AI076765 

438703 AI803373 

411424 AW845985 



Hs.1 1 3602 solute carrier family 1 (high affinity a 
Hs.167399 protocadherinalpha5 
Hs.269899 ESTs 
Hs.31599 ESTs 

gb:RC2-CT0163-200999-002-H08 CT0163 Homo 



45 407811 / 
410721 F 



418277 AW1 35221 

410688 AW796342 

420120 AL049610 

429597 NM_003816 

447033 AI357412 



Hs.113131 

Hs.217953 

Hs.21355 

Hs.153065 

Hs.40098 

Hs.2730 

Hs.5460 

Hs.1 80450 
Hs.2860 
Hs.1 24565 
Hs.1 30812 

Hs.95243 
Hs.2442 
Hs.157601 
Hs.106768 



5,10-methenyltetrahydrofolate synthetase 
ESTs, Moderately similar to NK-TUMOR REC 
doublecortin and CaM kinase-like 1 
ESTs 

cysteine knot superfamily 1 , BMP antagon 



416208 AW291168 
410708 AA534370 
447342 AI199268 
454563 AW807530 
411507 AW850140 
438170 AI916685 
416292 AA179233 



KIAA0776 protein 

gb:zf49g04.r1 Soares retina N2b4HR Homo 

ribosomal protein S24 

POU domain, class 5, transcription facto 

ESTs 

ESTs 

gb:PM2-UM0027-230200-002-h02 UM0027 Homo 

transcription elongation factor A (Sll)- 

a disintegrin and metalloproteinase doma 

EST-notinUniGene 

hypothetical protein FU10511 

ESTs 

hypothetical protein FU20001 

gb:7B02B10 Chromosome 7 Fetal Brain cDNA 



Hs.1 66468 programmed cell death 5 
Hs.41295 ESTs 

Hs.1 54088 Homo sapiens cDNA: FU22756 (is, clone K 
Hs.1 9322 ESTs; Weakly similar to !!!! ALU SUBFAMI 

gb:CM0-ST0081-130999-054-d02ST0081 Homo 
gb:IL3-CT021 9-261 099-023-D11 CT0219 Homo 
Hs.194601 ESTs 

Hs.42390 nasopharyngeal carcinoma susceptibility 



18.12 
17.90 
17.82 
17.82 
17.75 
17.72 
17.71 
17.67 
17.58 
17.55 
17.54 
17.50 
17.50 
17.44 
17.36 
17.31 



17.14 
17.14 
17.12 
17.12 
17.01 
17.00 
16.98 
16.94 
16.92 
16.91 



16.50 
16.50 
16.40 
16.32 
16.30 
1658 
1650 
16.09 
16.04 
16.04 
16.02 
16.02 
15.94 
15.93 
15.86 
15.85 
15.84 
15.84 
15.43 
15.42 
15.38 
15.37 
15.36 
1559 
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446686 AW138043 



434485 
441188 
5 444172 



BE14774Q 
BE244854 
AA279956 
AA410506 
AB023185 



BE177778 

AI239832 
AW006783 

M732480 



424240 
10 451118 

437495 

445467 

418305 

402812 
15 436851 

400991 

415752 BE314524 

429900 AA460421 

403683 

20 430315 NM_004293 
451952 AL120173 
424687 J05070 
447229 BE617135 



Hs.255609 ESTs 

Hs.104558 ESTs 

Hs.1 59578 Homo sapiens mRNA for FU00020 protein, 

Hs.88672 ESTs 

Hs.1 18578 H.sapiens mRNA (or ribosomal protein L18 

Hs.143535 calcium/calmodulin-dependent protein kin 

Hs.60640 ESTs 

gb:RC1-HT0598-31030C-012-f07 HT0598 Homo 

Hs.15617 ESTs, Weakly similar to ALU4_HUMAN ALU S 

Hs.6686 ESTs 

Hs.293581 ESTs 



25 448553 AI638449 
431039 BE041395 
459145 AI903354 



Hs.239147 gi 
Hs.301663 ESTs 

Hs.151738 matrix metalloproteinase 9 (gelatinase B 

gb:601441677F1 NIH_MGC_65 Homo sapiens o 
Hs.159581 matrix metalloproteinase 17 (membrane-in 
Hs.173031 ESTs 

Hs.283676 ESTs, Weakly similar to unknown protein 

gb:RC-BT029-1 001 99-1 17 BT029 Homo sapien 
Hs.297647 ESTs, Moderately similar to calcium chan 



445885 AI734009 
407938 AA905097 
431676 AI685464 
437210 AA311443 
451900 AB023199 Hs.27207 
445800 AA126419 Hs.301632 
412368 AW945992 Hs.181125 
Hs.300578 
Hs.301526 
Hs.16074 
Hs.75425 
Hs.109439 



EST cluster (not in UniQene) 



40 413551 BE242639 
421913 AI934365 
452712 AW838616 
451458 AW503398 
406038 Y14443 

45 424909 S78187 

434078 AW880709 
415254 AI815331 
418196 AI745649 
410020 T86315 

50 411352 NM_002890 
429848 AF145439 
413729 
400125 



448272 AI479094 

422695 AA315158 

424565 AW1 02723 

458048 H30340 

408894 AI935400 

454093 AW860158 

410889 XS1662 

457751 AI908236 

455131 AW857913 



ESTs 

Homo sapiens mRNA; cDNA DKFZp586E2317 (f 

K1AA0982 protein 

ESTs 

lambda locus 



Homo sapiens cDNA FU13181 fis, clone NT 
Homo sapiens mRNA; cDNA DKFZp564l153 (fr 
ubiquitin associated protein 
osteoglycin (osteoinductive factor, mime 
gb:RC5-LT0054-14020<M)l3-D01 LT0054Homo 
ESTs 

zinc finger protein 200 
cell division cycle 25B 
EST 
ESTs 

ESTs, Weakly similar to T00066 hypotheti 
ribonudsase, RNase A family, 2 (liver, 
RAS p21 protein activator (GTPase actfva 
chemokine (C-C motif) receptor 9 
gb:QV1-HT0412-270300-123-d10 HT0412 Homo 



Hs.96593 hypothetical protein 
Hs.170786 ESTs 

gb:EST186956 HCC cell line (matastasis t 
Hs.75295 guanylate cyclase 1 , soluble, alpha 3 
Hs.1 73705 Homo sapiens cDNA: FLJ22050 fis, clone H 
Hs517286 ESTs 

gb:RCO-CT0379-290100-032-b04 CT0379 Homo 
Hs.66744 twist (Drosophila) homolog (aaocephalos 

gb:!L-BT166-180399-010 BT166 Homo sapien 

gb:RC0-CT0323-231 199-031 -b05 CT0323 Homo 
Hs.128453 ESTs 
Hs.1 55965 ESTS 



Hs.210047 
Hs.88219 
Hs.153752 



Hs.728 
Hs.758 
Hs525946 



1556 
15.25 
1554 
1552 
1552 
15.16 
15.14 
15.14 
15.12 
15.12 
15.12 
15.06 



15.00 
14.96 
14.90 
14.84 
14.80 
14.72 
14.69 
14.67 
14.65 
14.63 
14.60 
14.55 
14.54 
14.46 
14.44 
14.42 
14.40 
14.36 
14.35 
14.32 
14.31 
1453 
1452 
1452 
1452 
1452 
1452 
14.16 
14.14 
14.07 
14.07 
14.05 
14.02 
13.98 
13.98 
13.95 
13.90 
13.88 
13.85 
13.80 
13.80 
13.78 
13.78 
13.76 
13.75 
13.74 
13.72 
13.69 
13.67 
13.62 
13.60 
13.53 
13.53 
13.49 
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430685 AI690234 Hs.191666 ESTs, Weakly similar to reverse transcri 13.47 

414052 AW578849 Hs.283552 ESTs, Weakly similar to unnamed protein 13.46 

447858 AW080339 Hs.211911 ESTs 13.44 

435716 AI573283 Hs.38458 ESTs 13.44 

5 439120 H56389 g b:yt87c03.r1 Soares_pineal_gland_N3HPG 13.43 

402788 13.40 

451591 AA886446 Hs.146278 ESTs 13.40 

405411 13.38 

426558 AW188574 Hs.24218 ESTs 13.34 

10 453506 AA132818 Hs.110407 ESTs, Weakly similar to coded for by C. 13.33 

416445 AL043004 Hs.300678 Human serine/threonine kinase mRNA, part 13.32 

457084 AI074149 Hs.150905 ESTs, Weakly similar to chondroiBn 4-su 13.32 

403838 13.32 

427337 Z46223 Hs.176663 Fc fragment of IgG, low affinity lllb.r 13.30 

15 434318 AW207552 Hs.116328 ESTs, Weakly similar to dJ1 34E15.1 [H.sa 13.28 

435193 N41359 Hs.218107 ESTs 1328 

414756 AW451101 Hs.159489 ESTs, Moderately similar to hexokinase I 1357 

420626 AF043722 Hs.99491 RAS guanyl releasing protein 2 (calcium 1326 

420052 AA418850 Hs.44410 ESTs 1325 

20 414020 NM_002984 Hs.75703 small inducible cytokine A4 (homologous 1325 

403351 1324 

422647 W07492 Hs.1 57101 ESTs 1321 

433598 AI762836 Hs.271433 ESTs, Moderately similar to ALU2_HUMAN A 1321 

409085 AB033113 Hs.50187 KIAA1287 protein 1320 

25 435063 R21966 Hs.57734 G protein-coupled receptor kinase-intera 13.19 

439367 BE386844 Hs.248746 ESTs 13.17 

451957 AI796320 Hs.10299 Homo sapiens cDNA FU13545 lis, clone PL 13.16 

420569 AA278362 Hs.289062 Homo sapiens cDNA FU12334 fis, clone MA 13.14 

447883 BE262802 Hs.4909 dickkopf (Xenopus laevis) homolog 3 13.07 

30 426490 NM_QQ1621 Hs.1 70087 aryl hydrocarbon receptor 13.06 

414789 AA1 55859 Hs.79708 ESTs 13.05 

451418 BE387790 Hs.26369 ESTs 13.04 

443494 T99719 Hs270404 Homo sapiens cDNA: FU22389 fis, clone H 13.03 

425B78 AW964806 Hs.38085 ESTs, Weakly similar to putative glycine 13.02 

35 431912 AI660552 Hs.154903 ESTs, Weakly similar to A561 54 Ablsubst 13.00 

407122 H20276 Hs.31742 ESTs 13.00 

456491 AL137466 Hs.97277 Homo sapiens mRNA; cDNA DKFZp434H1322 (f 12.99 

448172 N75276 Hs.135904 ESTs 12.98 

452144 AA032197 Hs.102558 ESTs 12.96 

40 419953 BE267154 Hs.125752 ESTs 12.96 

416182 NM_004354 Hs.79069 cyclinG2 12.94 

451154 AA015879 Hs.33536 ESTs 12.93 

412257 AW903830 gb:CM4-NN1037-25O400-155-h04NN1037Homo 12.93 

449784 AW161319 Hs.12915 ESTs 12.92 

45 432695 D63480 Hs.278634 KIAA01 46 protein 12.92 

454105 NMJJ01259 Hs.38481 cyclin-dependent kinase 6 12.92 

439093 AA534163 Hs.5476 serine protease inhibitor, Kazal type, 5 12.90 

416098 H41324 Hs.31581 ESTs, Moderately similar to ST1 B_HUMAN S 12.88 

424897 D63216 Hs.153684 frizzled-related protein 12.88 

50 414604 AU076649 Hs.76556 growth arrest and DNA-damage-inducible 3 12.88 

414664 AA587775 Hs.66295 Homo sapiens HSPC311 mRNA, partial cds 12.84 

452560 BE077084 gb:RC5-BT0603-220200-013-C07BT0603Homo 12.B4 

413869 NM_000378 Hs.75596 interleukin 2 receptor, beta - 12.B0 

452359 BE167229 Hs29206 Homo sapiens clone 24659 mRNA sequence 12.80 

55 435886 BE265839 Hs.12126 hepatocellular carcinoma-associated anti 12.78 

445230 U97018 Hs.12451 echinoderm microtubule-associated protei 12.78 

412226 W26786 gb:15d7 Human retina cDNA randomly prime 12.77 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 12.76 

447769 AW873704 Hs.48764 ESTs 12.76 

60 414478 AI306389 Hs.76240 adenylate kinase 1 12.76 

425383 D83407 Hs.156007 Down syndrome critical region gene 1-lik 12.68 

450704 H85157 Hs.40696 ESTs 12.66 

405856 12.66 

412935 BE267045 Hs.75064 tubulin-specifio chaperone c 12.65 

65 402802 12.62 

452588 AA889120 Hs.1 10637 HomeoboxAlO 12.62 

419978 NM_001454 Hs.93974 forkheadboxJI 12.62 

403137 12.60 

430226 BE245562 Hs2551 adrenergic, beta-2-, receptor, surface 1257 
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448076 AJ133123 


Hs.20196 adenylate cyclase 9 


1256 


450462 F07097 


Hs.300828 Homo sapiens mRM A full length insert cON 










409292 AA071051 


gb:zm58eQ5,s1 Stratagene fibroblast (937 




421540 AA767e69 


Hs. 10242 ESTs 


1247 


425840 AW978731 


Hs.301824 ESTs 




443181 AI039201 


Hs.54548 ESTs 




452436 BE077546 


Hs.31447 ESTs 




455183 AW9841 1 1 


gb:RCO-HNO0O7-1603OO-01 1-f09 HN0007 Homo 




432887 AI926047 


Hs.1 62859 ESTs 




410494 M36564 


Hs.64016 protein S (alpha) 






Hs.35598 ESTs 




451246 AW189232 


Hs.15995 ESTs ^ ^ 




432892 AL042615 






418982 AI348838 


Hs.1 3073 ESTs 




414516 AI307802 


Hs.279551 ESTs 




440134 BE410734 


gb:601301619F1 NIH MGC 21 Homo sapiens c 




443073 AL048542 


Hs.1 6291 ESTs 










454020 AW962845 


Hs.256527 ESTs 


12 24 


420077 AW512260 


Hs.87767 ESTs 




443337 AI984625 


Hs.9884 spindle pole body protein 




407519 X64979 


gb'H sapiens mRNA HTPCRX01 for olfactory 




435839 AF249744 






448552 AW973653 


Hs.201 04 hypothetical protein RJ00052 










451009 AA013140 


Hs.1 15707 ESTs 




423066 Y18264 


Hs 120171 ESTs 




439556 AI623752 


Hs.1 63603 ESTs 


12 16 


443062 N77999 


Hs.8963 Homo sapiens mRNA full length insert cDN 




445873 AA250970 


Hs.251946 Homo sapiens cDNA' FLJ23107 fls clone L 




453542 AW836724 


Hs 127699 ESTs ^' ^ ^ P 




440106 AA864968 






417605 AF006609 




1210 


440286 U29589 






420061 AW024937 


Hs.29410 ESTs 




458727 AI022813 






445407 AI222658 


Hs.221889 ESTs, Weakly similar to la costa [D mela 




418250 U29926 


Hs.83918 adenosine monophosphate deaminase (isofo 


1134 


414129 AI990287 


Hs.270798 ESTs 


11.93 


409799 D11928 






438461 AW075485 


Hs.28oQ49 phosphosenne aminotransferase 














ii in 


434217 AW014795 


Hs.23349 ESTs 




451533 NM_004657 


Hs.26530 serum deprivation response (phosphatidyl 




422423 AF283777 


Hs.1 16481 CD72 antigen 


1189 








423853 AB011537 


Hs.133466 slit (Drosophila) homobg 1 


ii a? 








406538 ^^^^ 


s.75909 KIAA0182 protein 


11 TO 


433253 AW450502 


Hs.24218 ESTs 




447397 BE247676 






451684 AF216751 


Hs.26813 CDA14^ 




416862 R23765 


Hs.23575 ESTs 




425770 NM 014363 


Hs.159492 spastic ataxia of Chanevoix-Saguenay (s 


11 72 


428826 AL048842 


Hs.194019 attractin 


1172 


433037 NM_014158 


Hs.279938 HSPC067 protein 


11.72 


447476 BE293466 


Hs.20880 ESTs 


11.72 


452092 BE245374 


Hs.27842 hypothetical protein FU11210 


11.72 


412922 M60721 


Hs.74870 H2.0 (Drosophila)-like homeo box 1 


11.72 


401680 NMJM5578 


Hs.180398 LIM domain-containing preferred transloc 


11.69 


422576 BE548555 


Hs.118554 CGI-83 protein 


11.68 


450203 AF097994 


Hs.301528 L-kynurenine/alpha-aminoadipate aminotra 


11.68 


410531 AW752953 


fib:QV0-CT0224-261 099-035-g02 CT0224 Homo 


11.67 


425917 W28517 


Hs.117167 Homo sapiens cDNA: FU23067 fis, done L 


11.66 


418693 AI750878 


Hs.87409 thrombospondin 1 


11.64 


400557 




11.62 
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416188 BE157260 






419047 AW952771 


Hs 90043 ESTs ^ ^ 




420441 AI986160 


Hs.88446 ESTs 


11 59 


409853 AW502327 


gb'UI-HF-BROp-aka-a-07-O-UI r1 NIH MGC 5 








434540 NM 016045 


H 5184 TH1 d h'l h loa 


11 55 


431449 M55994 






425928 S55736 


Hs 238852 ESTs Weakly similar to hypothetical pro 




434701 AA460479 


Hs.4096 KIAA0742 protein 






Hs 283978 ESTs* KIAA0738 gene product 


11 52 


420729 AW964897 






428328 AA426080 


Hs.98489 ESTs 




433887 AW204232 


Hs.279522 ESTs 


11 50 


414812 X72755 


Hs 22978 ESTs ' ^ 








1144 








459029 AA131376 






456267 AI127958 


Hs.83393 cystatin c/M 




433285 AW975944 






449186 AW291876 






447861 AI434593 


Hs.164294 ESTs 




456023 R00028 


gb"ye70a06.s1 Soares fetal liver spleen 


11 36 




Hs 54578 ESTs 










430886 L36149 


Hs.248116 cnemokine (U motit) m# receptor 1 




450784 AW246803 


Hs.47289 ESTs 




452391 AL044829 


Hs.29331 carnitine palmitoyltransferase I, muscle 




449625 NM_014253 






456827 AA075687 






439328 W0741 1 






432093 H28383 


gb:yl52c03.r1 Soares breast 3NbHBst Homo 


1124 


407335 AA631047 


Hs.158761 Homo sapiens cDNA FU13054 fis, clone NT 




442501 AA315267 


Hs.23128 ESTs 


11 99 


429746 AJ237672 


Hs.214142 5,10-methylen8tetrahydrofolate reductase 




422858 R35393 


gb'yg64g10 r1 Soares infant brain 1 NIB H 


1120 


415156 X84908 


Hs.78060 phosphorylase kinase! beta 




446713 AV660122 




1120 


452221 C21322 


Hs.11577 ESTs 




418261 W78902 


Hs.293297 ESTs 


11 17 


433332 AI367347 


Hs.127809 ESTs 




434539 AW748078 


nS.<;t*Hlv to IS 




413471 BE142098 


gb*CM4-HT0137-220999-017-d11 HT0137 Homo 




410037 AB020725 


Hs.58009 KIAA0918 protein 




458332 AI000341 


Hs.220491 ESTs 




427654 AA410183 








Hs 173717 phosphatide acid phosphatase type 2B 


11 10 


431475 AI567669 






425710 AF030880 


Hs.159275 solute earner family, member 4 




413748 AW1 04057 


Hs. 19193 ESTs 




409208 Y00093 






457278 W92745 


Hs. 193324 ESTS 




407021 U52077 


gb:Human mannerl transposase gene, comp 




445701 AF055581 


Hs.131 31 lymphocyte adaptor protein 


11 "02 


408338 AW867079 


gb:MR1"SN0033-12O4O0-O02-c10 SN0O33 Homo 




401030 BE382701 


Hs.25960 v-myc avian myelocytomatosis viral relat 


10.95 


437891 AW006969 


Hs.631 1 hypothetical protein FU20859 


10.94 


453874 AW591783 


Hs.361 31 collagen, type XIV, alpha 1 (undulin) 


10.94 


421562 AA530994 


Hs.105803 ghrelin precursor 


10.92 


413431 AW246428 


Hs.75355 ubtquitin-conjugaUng enzyme E2N (homolo 


10.92 


400132 




10.92 


436420 AA443966 


Hs.31595 ESTs 


10.90 


424880 NM 000328 


Hs.153614 retinitis pigmentosa GTPase regulator 


10.88 


433264 D85782 


Hs.3229 cysteine dioxygenase, type I 


10.88 


429842 AI366213 


Hs.173422 KIAA1605 protein 


10.87 


412405 AW948126 
400615 


gb:RCXI-MT0013-28030(M)31-a12 MT0013 Homo 


10.85 
10.80 


425018 BE245277 


Hs.154196 E4F transcription factor 1 


10.80 
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456011 BE243628 gb:TCBAP1D1053 Pediatric pre-B cell acut 10.79 

455982 BE176862 gb:RC4-HT0587-170300-O12-a04HT0587Homo 10.74 

450418 BE218418 Hs.201802 ESTs 10.73 

412490 AW803564 Hs.288850 ESTs 10.72 

5 436962 AW377314 Hs.5364 DKFZP564I052 protein 10.70 

437743 AI383497 Hs.131811 ESTs, Weakly similar to ALULHUMAN ALU S 10.70 

449967 R40978 Hs.271498 ESTs, Moderately similar to ALU1_HUMAN A 10.70 

449590 AA694070 Hs.268835 ESTs 10.68 

446035 NM_006558 Hs.13565 Sam68-like phosphotyrosine protein, T-ST 10.68 

10 426530 U24578 Hs.170250 complement component 4A 10.66 

428600 AW863261 Hs.15036 ESTs, Highly Similar to AF1 61 358 1 HSPC0 10.64 

420090 AA220238 Hs.94986 ribonuclease P (38kD) 10.64 

451593 AF151879 Hs.26706 CGI-121 protein 10.62 

438893 AF075031 Hs.29327 ESTs 1052 

15 459324 AW080953 gb:xc28c12.x1 NCLCGAP_Co18 Homo sapiens 10.61 

439883 AL359652 Hs.171096 Homo sapiens EST from clone DKFZp434A041 10.58 

406513 AA715328 Hs.291205 ESTs 1057 

407826 AA128423 Hs.40300 calpain 3, (p94) 1057 

419550 D50918 Hs.90998 KIAA01 28 protein; septin 2 10.56 

20 428522 R10184 Hs.191987 ESTs,WeaklysimilartoALU1_HUMANALUS 10.56 

459526 AI142350 Hs.146735 EST 1055 

411448 AA178955 Hs.271439 ESTs 10.54 

410102 AW248508 Hs.279727 ESTs; 1052 

406577 1052 

25 408405 AK001332 Hs.44672 hypothetical protein FLJ10470 1051 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 1050 

400880 10.48 

415875 AA894876 Hs.5687 protein phosphatase 1B (formerly 2C), ma 10.48 

434715 BE0O5346 Hs.1 16410 ESTs 10.46 

30 406851 AA609784 Hs.180255 major histocompatibility complex, class 10.44 

413409 AI638418 Hs.21745 ESTs 10.44 

418489 U76421 Hs.85302 adenosine deaminase, RNA-specilic, B1 (h 10.44 

419465 AW500239 Hs.21187 Homo sapiens cDNA: FU23068 fis, clone L 10.44 

419544 AI909154 gb:QV-BT200-010499 007BT200Homosapien 10.44 

35 432180 Y18418 Hs.272822 RuvB (E coli homolog)-like 1 10.44 

413822 R08950 Hs.272044 ESTs, Weakly similar to ALU1_HUMAN ALU S 10.42 

437446 AA788946 Hs.16869 ESTs, Moderately similar to CA1C RAT COL 10.41 

415701 NM 003878 Hs.78619 gamma-glutamyl hydrolase (conjugase, fol 10.41 

443790 NM_003500 Hs.9795 acyl-Coenzyme A oxidase 2, branched chai 10.40 

40 458873 AW150717 Hs.296176 STAT induced STAT inhibitor 3 10.38 

415082 AA160000 Hs.137396 ESTs 10.37 

429124 AW505086 Hs.196914 minor histocompatibility antigen HA-1 10.36 

417187 AB011151 Hs.81505 KIAA0579 protein 10.34 

426827 AW067805 Hs.172665 methylenetetrahydrofolate dehydrogenase 10.34 

45 424280 NM_000030 Hs.271366 alanine-glyoxylate aminotransferase homo 10.33 

446099 T93096 Hs.17126 ESTs 10.32 

423445 NM_014324 Hs.128749 alpha-methyiacyl-CoA racemase 10.31 

409995 AW960597 Hs.30164 ESTs 10.30 

432242 AW022715 Hs.162160 ESTs, Weakly similar to ALU4_HUMAN ALU S 10.30 

50 406394 AA172106 Hs.1 10950 Rag C protein 10.30 

406189 1059 

422283 AW411307 Hs.1 14311 CDC45 (cell division cycle 45, S.cerevis 1026 

401598 AA172106 Hs.110950 RagCprotein - 1026 

456995 T89832 Hs.170278 ESTs 1026 

55 416511 NMJM6762 Hs.79356 Lysosomal-associated multispanning membr 1024 

427274 NM_005211 Hs.174142 colony stimulating factor 1 receptor, fo 1024 

401384 1023 

456226 D13168 Hs.82002 endothelin receptor type B 1022 

426928 AF037062 Hs.172914 retinol dehydrogenase 5 (11-dsand 9-cis 1021 

60 423032 AI684746 Hs.1 19274 ESTs 1020 

436556 AI364997 Hs.7572 ESTs 1020 

418400 BE243026 Hs.301989 KIAA0246 protein 10.19 

437401 AA757196 Hs.121190 ESTs 10.19 

403690 10.17 

65 423790 BE152393 gb:CM2-HT0323-171199-033-a08HT0323Homo 10.16 

434094 AA305599 Hs238205 hypothetical protein PRO2013 10.16 

434967 AW975009 Hs292274 ESTs 10.16 

432827 Z68128 Hs.3109 Rho GTPase activating protein 4 10.16 

432660 AI288430 Hs.64004 ESTs 10.14 
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452234 AW0B4176 


Hs 223296 ESTs 




445629 AI245701 


gb:qk31f05.x1 NCI CGAP Kid3 Homo sapiens 


1013 


457236 AA626142 


Hs 179991 ESTs Weakly similar to KPCE HUMAN PROTE 




444605 Al 174603 


Hs.254105 enolase 1 (alpha) 




450313 AI038989 


Hs 24809 hypothetical protein FUJI 0826 




407482 NM 006056 






449971 AA807346 


Hs.288581 Homo sapiens cDNA FLJ14296 fis clone PL 
















417308 H 60720 


Hs.81892 KIAA0101 gone product 


1009 




Hs 1 79303 ESTs 




437252 AI433833 












434467 BE552368 






423698 AA329796 


Hs.1098 DKFZp434J1813 protein 


1002 


412707 AW206373 


Hs 16443 Homo sapiens cDNA* FU21721 fis clone C 




414658 X58528 


Hs.76781 ATP-binding cassette, sub-family D (AL.D) 




421832 NM 016098 


Hs.1 03725 HSPC040 protein 


1000 


423554 M9051 6 








Hs 172510 ESTs 




434673 AW1 37442 


Hs. 135965 ESTs 






Hs 180040 Homo sapiens cDNA" FLJ22439 fis clone H 


1000 


457803 BE501815 


Hs!l98011 ESTs 


9.99 


428279 AA425310 


Hs.1 55766 ESTs 




444412 AH 47652 


Hs.216381 Homo sapiens clone HH409 unknown mRNA 




417049 N72394 


Hs.44862 ESTs 




427509 M62505 






445424 AB028945 












447567 AW474513 






414709 AA704703 


Hs.77031 Sp2 transcription factor 








994 


427630 BE276115 


HS.144980 ESTS, Weakly Similar 10 CA13_HUMAN colla 




416111 AA033813 






423349 AF010258 
424308 AW975531 


Hs. 154443 minichromosome maintenance deficient (S. 




416814 AW1 92307 


Hs.80042 doliohyl-P-'Glc:Man9GlcNAc2-PP-dolichylgl 


9.90 


4179B6 AA481003 






425174 D87450 






438171 AW976507 


Hs.293515 ESTs 




421984 AW972187 






408597 NM_005291 






413907 AI09757O 






451296 AW801383 


ns.1 18573 n. sapiens mRNA lor ribosomal protein LIS 




433409 AI278802 


Hs.25661 ESTs 


rjj? 


450360 AW1 17416 


Hs 245484 ESTs 










449824 AI962552 


Hs.226765 ESTs 




431066 AF026273 


Hs 30504 Homo sapiens mRNA' cDNA DKFZp434E082 (fr 
Hs.249175 JnterIeukin-1 receptor-associated kinase 


982 


426457 AW894667 




980 


443371 AI792888 






437159 AL050072 


gb:nomo sapiens mRNA; cDN A DKFZp566E1 346 






Hs 155287 KIAA001 0 gene product 




447498 N67619 


Hs 43687 ESTs 




426759 A1590401 


Hs.21213 ESTs 




435129 AI381659 


Hs.267086 ESTs 


9.72 


437672 AW748265 


Hs.5741 flavohemoprotein b5+b5R 


9.72 


438209 AL120659 


Hs.6111 K1AA0307 gene product 


9.72 


438440 AA807228 


Hs.225161 ESTs 


9.72 


449720 AA311152 


Hs.288708 ESTs; Weakly similar to KIAA0226 [H.sapi 


9.72 


414291 AI289619 


Hs.13040 ESTs 


9.72 


436206 AK001451 


Hs.265561 CD2-associated protein 


9.70 


446896 T15767 


HS22452 Homo sapiens cDNA: FLJ21084 fis, done C 


9.70 


412667 AW977540 


Hs.269254 ESTs 


9.70 


423301 S67580 


Hs.1645 cytochrome P450, subfamily IVA, polypept 


9.67 


440757 AW1 18645 


Hs.160004 ESTs 


9.67 


441412 AI393657 


Hs.159750 ESTs 


9.66 


421044 AF061871 


Hs.101302 collagen, type XII, alpha 1 


9.66 
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441530 AI248301 
433533 D53304 
421470 R27496 
438613 C05569 
429324 AA488101 
450244 M007534 
407660 AW063190 
406554 

426404 M377607 
447045 AW392394 
449894 AK001578 
448376 AI494332 
407902 AL1 17474 
446572 AV659151 



414697 BE266134 

410846 AW807057 

421181 NMJM5574 

427308 D26067 

415995 NM_004573 



414342 M742181 



449375 R07114 
436357 AJ132085 
458723 AW137726 
457526 AW450584 
404741 

422409 NM_005428 
403708 

408806 AW847814 
417380 T06809 
422501 AA354690 
426197 AA004410 



Hs.280099 ESTs 

Hs.124981 ESTs 

Hs.181125 immunoglobulin lambda locus 

Hs.127112 ESTs 

Hs.65394 ESTs 

Hs.1378 annexinA3 

Hs.243122 hypothetical protein FU13057 similar to 

Hs.199245 inactivation escape 1 

Hs.125062 ESTs 

Hs.279101 ESTs 

Hs.273138 ESTs 

Hs.278569 KIAA00S4 gene product 
Hs.24129 hypothetical protein FU10716 
Hs.196963 ESTs 

Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 



manic fringe (Drosophila) homotog 
chromosome 21 open reading frame 5 
transferase of outer mitochondrial membr 
gb:MR4-ST0062-031 199-018-b03 ST0062 Homo 
LIM domain only 2 (rhombotin-like 1) 
KIAA0033 protein 
phospholipaseC,beta2 
ESTs 

Homo sapiens cDNA: FU22199 fis, clone H 

ubiquitin-conjugating enzyme E2A (RAD6 h 

ESTs 

ESTs 

ESTs 

gb:Homo sapiens mRNA for axonemal dynein 
ESTs, Moderately similar to laminin alph 
ESTs, Weakly similar to RIBB [H .sapiens] 



Hs.31939 
Hs.129781 
Hs.76927 



Hs.174905 

Hs.994 

Hs.119768 

Hs.75912 

Hs.80612 

Hs.5588 

Hs.270745 

Hs^71224 



412110 i 

414158 AA361623 
408101 

414171 , 

415947 U04045 

426959 BE262745 

417519 AI689987 



Hs.1 16237 vavl oncogene 

Hs.289005 Homo sapiens cDNA: FU21532 fis, clone C 

gb:EST04698 Fetal brain, Stratagene (cat 
Hs.144967 ESTs 

Hs.167835 acyl-Coenzyme A oxidase 1, palmitoyl 
Hs.3O054 coagulation factor V (proaccelerin, iabi 

gb:RCO-NNO&1-<M0400-021-c10NN0021 Homo 
Hs.288775 Homo sapiens cDNA FU1 3900 fis, clone TH 
Hs.123073 CDC2-related protein kinase 7 
Hs.865 RAP1 A, member of RAS oncogene family 
Hs.78934 mutS (E. coli) homolog 2 (colon cancer, 

gb:601 153869F1 NIH_MGC_19 Homo sapiens c 
Hs.177669 ESTs, Weakly similar to RMS1_HUMAN REGUL 
Hs.296422 FK506-binding protein 3 (25kD) 



446566 H95741 Hs.17914 Homo sapiens cDNA: FU22801 fis, clone K 

gb:CM1-NN1031-060400-178-d05 NN1031 Homo 
Hs.152299 ESTs, Moderately similar to ALU5_HUMAN A 
Hs.127832 ESTs 
Hs.251871 CTP synthase 
429327 AA283981 Hs.199248 prostaglandin E receptor4 (subtype EP4) 
407061 X97748 gbiH.sapiens PTX3 gene promotor region. 

416967 BE616731 Hs.80645 interferon regulatory factor 1 
423013 AW875443 Hs.22209 secreted modular calcium-binding protein 
439461 AA693960 Hs.103158 ESTs 

418830 BE513731 Hs.88959 Human DNA sequence from clone 967N21 on 
422763 AA033699 Hs.83938 ESTs,Moderately,similartoMASP-2(H.sa 
442739 NMJM7274 Hs.8679 cytosolic acyl coenzyme A thioester hydr 
452859 AI300555 Hs.288158 Homo sapiens cDNA: FU23591 fis, clone L 
403237 

415000 AW025529 Hs.239812 ESTs, Weakly similar to CALM_HUMAN CALMO 
417951 AW976410 Hs.289069 Homo sapiens cDNA:FU21016 fis, clone C 
419066 Z98492 Hs.6975 PRO1073 protein 
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448443 AW1 67128 


Hs.231934 ESTs 


9.30 








409768 AW499S66 


gb:UI-HF-BROp-aji-h-03-0-Ul.f1 NIH_MGC_5 




453708 AI191811 


Hs.54629 ESTs 




442271 AF000652 


Hs.8180 syndecan binding protein (syntenin) 




410055 AJ250839 


Hs.58241 gene for serine/threonine protein kinase 




448692 AW013907 


Hs.224276 ESTs, Moderately similar to predicted us 




417381 AF164142 


Hs.82042 solute carrier family 23 (nucleobase tra 




422497 D29642 


Hs.1 528 KIAA0053 gene product 




414140 AA281279 


Hs.23317 ESTs 




435980 AF274571 


Hs.129142 ESTsj Weakly similar to DEOXYRIBONUCLEAS 




458530 BE395035 


Hs.1 99889 ESTs, Weakly similar to KIAA0874 protein 


9.24 


420819 AA280700 


gb:zs95h1 1.s1 NCLCGAP_GCB1 Homo sapiens 




444755 AA431791 


Hs.1 83001 ESTs 




411630 U42349 


Hs.71 1 1 9 Putative prostate cancer tumor suppresso 


qff) 


421246 AW5S2962 


Hs.300961 ESTs, Highly similar to AF151805 1 CGI-4 




421924 BE514514 


Hs,1 09606 ooronin, actin-binding protein, 1A 




414888 AL039185 


Hs.77558 thyroid hormone receptor interactor 7 




434267 AI206589 


Hs.1 16243 ESTs 




409213 U61412 


Hs.51133 PTK6 protein tyrosine kinase 6 




428242 H55709 


Hs.2250 leukemia inhibitory factor (cholinergic 




451736 AW080356 


Hs.293684 ESTs, Weakly similar to alternatively sp 


9.15 


413627 BE182082 


Hs.246973 ESTs 




416134 AA528402 


Hs.74861 activated RNA polymerase II transcriptio 




449251 AW151660 


Hs.31444 ESTs 


9.14 


452813 U54727 


Hs.1 91445 ESTs 




443622 AI911527 


Hs.1 1805 ESTs 


9.14 


413260 6E075281 


gb:PM1-BT0585-29020O-O05-d07 BT0585 Homo 




413450 Z99716 


Hs.75372 N-acetylgalactosaminidase, alpha- 


9.12 


446442 6E221533 


Hs.257858 ESTs 




438540 M810021 


Hs.1 36906 ESTs 


9.12 


426251 M24283 


Hs.168383 Intercellular adhesion molecule 1 (CD54) 




410290 AA402307 


Hs.73818 iibiquinol-cytochrorne c reductase hinge p 




437398 AA913736 


Hs.1 2671 5 ESTs 




421559 NM_014720 


Hs.1 05751 Ste20-related serine/threonine kinase 




439699 AF086534 


Hs.1 87561 ESTs, Moderately similar to ALU1_HUMAN A 




430799 C19035 


Hs.1 64259 ESTs 




424544 M8870Q 


Hs.150403 dopa decarboxylase (aromatic L-amino aci 




453942 AW1 90920 


Hs.1 9928 ESTs 




425844 T68073 


Hs.159628 serine (or cysteine) proteinase inhibito 




434658 AI624436 


Hs.1 94488 ESTs 




453999 BE328153 


Hs.240087 ESTs 




436490 R71543 


Hs.18713 ESTs 


9.05 


409192 AA065131 


Hs.233439 ESTs, Weakly similar to ALU7_HUMAN ALU S 




446223 BE300091 


Hs.1 1 9699 hypothetical protein FU12969 




447247 AW369351 


Hs.287955 Homo sapiens cDNA FUI1 3090 fis, clone NT 




450094 AI174947 


Hs.295789 Homo sapiens mRNA; cDNA DKFZp564D1164 (f 


on! 


432012 AW301344 


Hs.195969 ESTs 




422520 AU076730 


Hs.1 17977 kinesin2(60-70kD) 




418650 BE386750 


Hs.86978 prolyl endopeptfcfase 




423008 M81590 


Hs.123016 5-hydroxytryptamine (serotonin) receptor 




436476 AA326108 


Hs.53631 ESTs 




448206 BE622585 


Hs.3731 ESTs 


9.02 


431574 AW572659 


Hs.261373 adenosine A2b receptor pseudogene 




443453 R99876 


Hs.269882 ESTs 




435472 AW972330 




901 


420337 AW295840 


Hs!l4555 Homo sapiens cDNA: FU21513 fis, clone C 


9^00 


449810 AB008681 


Hs.23994 activin A receptor, type IIB 


9.00 


406780 M9023B6 


Hs.286 ribosomal protein L4 


8.99 


429169 AW341130 


Hs.197757 ESTs, Moderately similar to FGFE_HUMAN F 


8.99 


421326 AF051428 


Hs.103504 estrogen receptor 2 (ER beta) 


8.97 


425491 AA883316 


Hs.255221 ESTs 


8.96 


425516 BE000707 


Hs.29567 ESTs 


8.9S 


439773 AI051313 


Hs.143315 ESTs 


8.96 


443247 BE614387 


Hs.47378 ESTs 


8.96 


4S6623 AI0B4125 


Hs.108106 transcription factor 


8.95 


438707 L08239 


Hs.5326 porcupine 


8.95 


402240 




8.95 
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444152 
409842 
416277 



AI125694 
AWS01756 
W78765 



411486 N 



I AL1 20344 
D AI287817 
3 AA002064 



442732 AA257161 
419741 NM_007019 
411499 AW849292 
431154 AW971228 
414922 D00723 
418036 Z37976 
406422 

422926 NM_016102 



Hs.149305 Homo sapiens cDNA FU14264 lis, clone PL 

gb:UI-HF-BROp-ajm-c-09-O-Ul.r1 NIH_MGC_5 

Hs.73580 ESTs 

Hs.1 11334 ferritin, light polypeptide 

Hs.66170 HSKM-B protein 

Hs.75074 mitogen-activated protein kinase-activat 

Hs.129636 ESTs 

Hs.18920 ESTs 

Hs.1 81 1 65 eukaryotic translation elongation factor 

Hs.20999 Homo sapiens cDNA: aJ231 42 fis, clone L 

Hs.8658 hypothetical protein DKFZp434EQ321 

Hs.93002 ubiquitin carrier protein E2-C 

gb:IL3-CT0215-020300-090-E06 CT0215 Homo 



Hs.86575 
Hs.271711 
Hs.274415 



450000 AI952797 



419972 AL041465 Hs.294038 
Hs.299112 
Hs.301834 
Hs.1 04930 
Hs.1 37565 
Hs.250614 
Hs.21766 



428807 AA435997 
415839 R40611 
419553 N34145 
420309 AW043637 
421863 AI952677 Hs.10897£ 
447965 AW292577 Hs.94445 
459172 BE063380 



ring finger protein 16 
HGF activator 
CDC28 protein kinase 2 
mitogen-activated protein kinase kinase 
ESTs 

Homo sapiens cDNA FU10229 fis, clone HE 
Homo sapiens cDNA: FU21559 fis, clone C 
ESTs 

hexokinase 3 (white cell) 

ESTs, Moderately similar to ALU2.HUMAN A 

Homo sapiens cDNA FU11441 lis, clone HE 

Homo sapiens cDNA FU10952 lis, clone PL 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (fr 
ESTs 

gb:PMO-BT0275-291099-0O2-g10 BT0275 Homo 



411534 AW850473 

456161 BE264645 

40 413554 AA331881 
401744 

425348 AL137477 



429455 AI472111 



gb:IL3-CT021 9-280100-061 -B1 1 CT0219 Homo 
Hs.282093 Homo sapiens cDNA: FU21918 fis, clone H 
Hs.75454 peroxiredoxin 3 



441746 H59955 

411945 AL033527 

413492 D87470 

435706 W31254 

433741 AA609019 

426340 Z97989 

422779 AA317036 

449785 AI225235 

420144 AA811813 



Hs.155912 
Hs.127950 
Hs.297722 
Hs.44229 
Hs.1 34015 
Hs.153716 
Hs.179765 
Hs.216640 
HS22703 
Hs.292507 
Hs.138902 
Hs.127829 
Hs.92137 
Hs.75400 
Hs.7045 



Hs.41989 



432606 NMJW2104 

425762 BE244076 

427448 BE246449 

418033 W68180 

429084 AJ001443 

417094 NMJW6895 

457277 NM.004736 



Hs.1 19421 

Hs.31178 

Hs.3066 

Hs.159578 

Hs.2157 



cadherin-like 24 

bromodomain-containing 1 

Human DNA sequence from clone RP1-85F18 

dual specificity phosphatase 12 

uronyl 2-sulfotransferase 

Homo sapiens mRNA for Hmob33 protein, 3' 

Homo sapiens mRNA; cDNA DKFZp586H1921 (f 

ESTs 

Homo sapiens mRNA; cDNA DKFZp434P1018 (f 
ESTs 

ESTs, Weakly similar to B34087 hypotheti 
ESTs 

v-myc avian myalocytomatosis viral oncog 
KIAA0280 protein 
GL004 protein 
ESTs 

FYN oncogene related to SRC, FGR, YES 
ESTs 

Homo sapiens cDNA: FU23231 fis, clone C 

ESTs 

ESTs 

granzyme K (serine protease, granzyme 3; 
Homo sapiens mRNA for FU00020 protein, 
Wiskott-Aldrich syndrome (eczema-thrombo 
Homo sapiens cDNA FU12507 fis, clone NT 
splicing factor 3b, subunit3, 130kD 
histamine N-methyltransterase 
xenotropic and polytropic retrovirus rec 
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422631 BE218919 Hs.118793 hypothetical protein FU10688 
410679 AW795196 Hs.215867 ring finger protein 14 
431585 BE242803 Hs.262823 hypothetical protein FU1 0326 



430259 BE550182 



451491 
452561 
420027 



AF009746 

X54136 

U91939 

AI479773 
BE328882 
AA091228 



a disintegrin and matalloprotainase doma 
PIBF1 gene product 
CIpX (caseinolytic protease X, E. coli) 
RalGEF-like protein 3, mouse homolog 
ESTs, Weakly similar to ALU7.HUMAN ALU S 
gb:RC3-BT0502-130100-012-g07 BT0502 Homo 



Homo sapiens cDNA FU13741 fis, clone PL 
KIAA1634 protein 

ATP-binding cassette, sub-family D (ALD) 
immunoglobulin lambda locus 
6 protein-coupled receptor 25 



Hs.181679 ESTs 

Hs.193096 ESTs, Moderately similar to U119_HUMAN U 

gb:cchn2152.seq.F Human fetal heart, Lam 
Hs.85201 C-type (calcium dependent, carbohydrate- 
Hs.196352 neutrophil cytosolic factor 4 (40kD) 
Hs.19S641 ESTs 
Hs.199371 ESTs 

gb:yq94h03.r1 Soares fetal liver spleen 
Hs.13503 i " 



Hs.49169 
Hs.94395 
Hs.181125 



411079 
418452 
429109 
448019 
449865 
431180 
445988 
405B76 

407235 D20569 Hs.169407 SAC2 (suppressor of actin mutations 2, y 

414807 AI738616 Hs.77348 hydroxyprostaglandin dehydrogenase 15-(N 

425671 AF193612 Hs.159142 lunatic fringe (Drosophila) homolog 

452413 AW082633 Hs.212715 ESTs 

421620 AA446183 Hs.91885 ESTs 

444539 AI955765 Hs.146907 ESTs 

Hs.77929 excision repair cross-complementing rode 



415102 M31899 
405552 
418068 AW971155 
420133 AA426117 
438887 R68857 
446468 AI765890 
446585 AV659397 
441896 AW891873 
437718 AI927288 
420656 AA279098 
429303 AW137635 
450624 AL043983 
452573 AI907957 



Hs.293902 ESTs, Weakly similar to prolyl 4-hydroxy 
Hs.14373 ESTs 
Hs.265499 
Hs.16341 
Hs.282948 



Hs.196779 
Hs.187636 
Hs.44238 
Hs.125063 



ESTs; Moderately similar to till ALU SUB 
ESTs 

gb;CM3-NT0090-040500-173-b02 NT0090 Homo 



Hs.122647 
Hs.75613 
Hs.156827 



Homo sapiens cDNA FU1 3825 iis, clone TH 
Homo sapiens cDNA FU14082 fis, clone HE 
N-myristoyltransferase 2 
CD36 antigen (collagen type I receptor, 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
regulator of G-protein signalling 17 



421828 AW891965 

417022 NM_014737 

421029 AW057782 

425171 AW732240 



412643 AW971239 
424775 AB014540 
446848 AW136083 
448043 AI458653 
407183 AA358015 
412324 AW978439 
419594 AA013051 



dimethyla 



Irolase 



431689 AA305688 



Hs.80905 Ras association (RalGDS/AF-6) domain fam 
Hs.293053 ESTs 
Hs.300615 ESTs 

gb:wj71c12.x1 NCI_CGAP_Lu19 Homo sapiens 

Hs.293982 ESTs 

Hs.1 53026 SWAP-70 protein 

Hs.195266 ESTs, Weakly similar to S59501 interfero 

Hs.201881 ESTs 

gb:EST66864 Fetal lung III Homo sapiens 
Hs.69504 ESTs 

Hs.91417 topoisomerase (DNA) II binding protein 

gb:EST384925 MAGE resequences, MAGL Homo 
Hs.267695 UDP-Gal:betaGlcNAcbeta 1,3-galactosyltr 
Hs.283365 ESTs. Weakly similar to ALU5.HUMAN ALU S 
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447685 AL122043 

459119 AW844498 
400817 

425265 BE245297 

409385 AA071267 

439121 BE047779 

419963 X04430 

408327 AW1 82309 



448064 AA379036 
442914 AW188551 
428032 AW997704 
434194 AF1 19847 
45B677 AW937670 
420925 NM_015698 
416475 T70298 
416852 AF283776 
430676 AF084866 
428455 AI732694 
435343 AW194962 
450783 BE266S95 
404946 

422942 AF054839 
453716 AA037675 
437098 AA744488 
443907 AU076484 
401930 AF106069 
446554 AA151730 
426290 AB007918 
419904 AA974411 



424738 AI963740 



424534 D87682 
424429 U63830 
442604 BE263710 
442992 AI914699 
427210 



40 423730 AA330214 



416051 AA835868 
417231 R40739 
422049 W25760 
427528 AU077143 
458776 AV654978 
417687 AI828596 
423218 NM_015896 
425397 J04088 
406964 M21305 
402401 U42349 
423397 NM 001838 
427857 AL133017 
401519 

447188 H65423 
424704 AI263293 
435854 AJ278120 
448556 AW885606 
449217 AA278536 
453124 AI139058 
442812 AI018406 
421129 BE439899 



gb:TCBAP1 E2482 Pediatric pre-B cell acut 
gb:zm61g01.r1 Stratagene fibroblast (937 
Hs.44701 ESTs 

Hs.93913 interleukin 6 (interferon, beta 2) 
Hs.249963 ESTs, Highly similar to dJI 170K4.4 [H.sa 

gb:EST91809 Synovial sarcoma Homo sapien 
Hs.99519 Homo sapiens cDNA FU14007 Us, clone Y7 
Hs.1 1493 Homo sapiens cDNA FU13536 fls, done PL 
Hs.283940 Homo sapiens PRO1550 mRNA, partial cds 
Hs.254379 ESTs 
Hs.1 00391 T54 protein 

gb:yd26g02.s1 Soares fetal liver spleen 
Hs.80285 Homo sapiens mRNA; cDNA DKFZp586C1723 (f 

gb:Homo sapiens envelope protein RIC-3 ( 
Hs.98520 ESTs 
Hs.1 99028 ESTs 

gb:601190242F1 NIH_MGC_7 Homo sapiens cD 



Hs.1 22540 
Hs.1 52675 
Hs.1 32842 
Hs.9963 
Hs.23163 
Hs.301789 
Hs.169182 
Hs.1 8672 



Hs.46826 
Hs.79881 
Hs.150275 
Hs.146847 
Hs.279904 
Hs.13297 
Hs.1 73987 



tetraspan2 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 

TYRO protein tyrosine kinase binding pro 

ubiquitin specific protease 15 

ESTs, Weakly similar to similar to C.ele 

KIAA0449 protein 

ESTs 

ESTs, Weakly similar to TRHY_HL)MAN TRICH 
ESTs 

Homo sapiens cDNA: FU23006 fls, clone L 
KIAA0241 protein 

TRAF family member-associated NFKB activ 

ESTs 

ESTs 



Hs.19121 

Hs.25253 

Hs.21326 

Hs.77631 

Hs.179565 

Hs.19904 

Hs.250691 

Hs.167380 

Hs.156346 

Hs.247946 

Hs.71119 

Hs.1 652 

Hs.2210 



gb:EST33935 Embryo, 12 week II Homo sapi 
adaptor-related protein complex 2, alpha 
Homo sapiens cDNA: FLJ20935 fis, clone A 
ESTs 

glycine cleavage system protein H (amino 
minichramosome maintenance deficient (S. 
cystathionase (cystathionine gamma-lyase 
ESTs 

BLu protein 

topoisomerase (DNA) II alpha (170kD) 
Human alpha satellite and satellite 3 j j 
Putative prostate cancer tumor suppress© 
chemokine (C-C motiO receptor 7 
thyroid hormone receptor interactor 3 



Hs.17631 Homo sapiens cDNA FU20118 fis, clone CO 

Hs.1 52096 cytochrome P450, subfamily II J (arachido 

Hs.4996 DKFZP564D166 protein 

Hs.5064 ESTs 

Hs.23262 ribonuclease, RNase A family, k6 

Hs.23296 ESTs 

Hs.131284 ESTs 

Hs.89271 ESTs 
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TABLE 9A shows the accession numbers for those primekeys lacking a unigeneK) in Table 
9. For each probeset we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 



Pkey CAT number 

408057 1035720_-1 
408069 103655J 
408182 104479J 



409126 110159J 

409292 111586J 

409314 111841J 

409385 112523.1 

409398 1123716J 

409671 114731J 

409768 1154035J 

409841 1158088J 

409842 1156119J 
409853 1156226J 
410531 1207200J 
410688 121B101J 
410846 1223902J 



H81795 Z42291 R20973 AA046920 

AA047854 M057506 AA053841 

AW867079 AW867086 AW182772 

BE540279 AW410659 AA057857 R77693 BE278674 

AA063426 AW962323 AW408063 AA063503 M772927 AW753492 BE175371 AA31 1 147 
AA071051 AA070584 AA069938 AA102136 AA074430 
AA070266 M084967 AA126998 
AA071267 T65940 T64515 AA071334 

AW386461 AW876408 AW386672 AW388599 AW876258 AW386619 AW386289 AW876136 AW876203 AW876213 AW876301 
AW876295 AW876349 AW876365 AW876160 AW876369 AW876352 AW876271 
AA076769 AA076781 AI087968 

AW499566 AW502378 AW499522 AW502046 AW502671 AW501917 AW501858 AW501721 AW502813 
AW502139 AW502432 AW502235 AW501683 AW502647 
AW501756 AW502096 AW502465 AW501715 



AW752953 H88044BE156092 
AW796342 AW796356 BE161430 

AW807057 AW807054 AW807189 AW807193 AW807369 AW807429 AW807364 AW807365 AW807078 AW807256 AW807180 



410896 1226053J AW809637 AW809697 AW810554 AW809707 AW809885 AW810000 AW810038 AW809742 AW809816 AW809749 AW809639 
AW809722 AW809836 AW809774 AW81 0023 AW81001 3 AW809813 AW8096S0 AW809728 AW809768 AW809951 AW809657 
AW809954 

411079 123128J AA091228 H7186D H71073 



AW849292 AW849431 AW849422 AW849428 AW849420 AW849424 AW849427 
AW350140 AW8501 95 AW850192 
AW850473 AW850471 AW850431 Al 



411507 
411534 
411972 

412110 1277844J AW893569 AW893571 AW893588 AW893593 
412226 
412257 
412405 



9_1 W26786 AW998612 AW902272 



1356003J 
1371778J 
13851 14_1 
142409J 
1511339J 



BE142098 BE142092 

BE1 59999 BE1 60056 BE160107 BE160139 

AA1 36301 AI381776AA1 36321 

T81668C19040C17569 

F06495 R24336 R13046 

H14487R50911Z43216 

T70298 H58072 R02750 

T06809 N75735 

W28573 

AW74961 7 R64714 AA2441 38 AA2441 37 BE094019 
AI909154 AA526337 AA244193 AI909153 
AA280700 AW975494 AA687385 



413471 
413729 
414182 
414989 
415354 
416011 1566439J 
416475 1596398J 
417380 1672461J 
419392 1843934_-1 
419541 185724J 



65 422673 219674J N59027AA314694 N53937R08100 
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AA315158 AW961298 N76067 AW802759 AI858495 W04474 




222209 1 


R35398 BE252178 AA318153 
BE077458 AA337277 AA319285 






AA330214 AW962519 T54709 






BE152393 AA330984 BE073904 






AA339666 AW952809 AA3491 1 9 




241409 1 


AA343933 AA344060 AW963081 


J(Pqrq 




BE245297 AA353976 AW505023 




273830 -1 


BE262745 

AF084866 AF084870 AF084864 AF084867 AF084869 AF084865 AF084868 AW818206 AW812038 BE144813 BE144812 






AW812041 AW812040 AW812067 BE061583 BE061604 T05808 AI352469 AA580921 BE141783 BE141782 BE061601 




326269 1 


AW814393 AW885029 

AW972830 AA527647 M489820 AA570362 


431180 


328906~1 


H55883 AW971249 AA493900 H55788 


432093 


3412B3J 


H28383 AW972670 H28359 AA525808 


434596 


38937 1 


T59538 T59E89 T59598 T59542 AF147374 


436357 


41842J 


AJ132085 Z83805 






AL050072 AW900148 


437495 


43765~1 


BE177778 BE177779 AL390180 AA359908 






H66948 AF085954 H66949 






H56389 AF085977 H56173 






BE41 0734 BE5601 17 BE270054 BE296330 BE267957 AI003007 BE545259 






AW891873 AW891897 BE564764 






AI245701 BE272724 






BE617135 AW504051 AW504283 






AA379036 AA150589 AI696854 BE621316 
BE266695 BE265474 N53200 BE267333 






AA215672 AI696628 AA013335 H86334 AA017006 


452560 


922216J 


AI907039 AI907081 

BE077084 AW139963 AW863127 AW806209 AW806204 AW806205 AW806206 AW80621 1 AW806212 AW806207 AW806208 






AW806210AI907497 




928309 1 


AW838616 AW838660 BE144343 AI914520 AW888910 BE184854 BE184784 




980026~1 


U83527 AL1 20938 U83522 






AW860158 AW862385 AW860159 AW862386 AW862341 AW821869 AW821893 AW062660 AW062656 




1224342 1 


AW807530 AW807540 AW807537 AW846086 BE141634 AW846089 AW807499 AW807533 AW838499 


454791 


1234759~1 


BE071874 BE071882 AW820782 AW821007 


454977 


1247099J 


AW848032 AW848630 AW848478 AW848623 AW848484 AWB48169 AW848830 AW848149 AW848119 AW848893 AW848903 






AW848407 


455131 


1254674J 


AW857913 AW857916 AW857914 AW861627 AW861626 AW861624 


455183 


1259023_1 


AW984111 AW863918 AW863856 




1266449 1 


AW877015 AW8771 33 AW876978 AW877071 AW376988 AW877069 AW877063 AW877013 




1285173 16 


AW9 03533 AW9035 16 AW903562 BE085202 BE035215 BE085214 BE085209 BE085172 BE035175 BE085193 BE0B5211 


455932 


1396849J 


BE085199 

BE176862 BE176876 BE176947 BE176878 


456011 


1410860J 


BE243628 BE246081 BE247016 BE241984 BE241534 BE246091 BE245679 BE243620 BE245998 BE242329 BE241417 






BE241457 BE242522 BE241989 BE241464 


456023 


1416335 1 


R00028BE247630 


457586 


360505J 


AW062439 AW751554 AA579463 


457595 
457751 


364225 -1 
399422 1 


AA584854 
AI908236AA663731 


459070 


883688 1 


AI814302AI814428 
W07808AI822066 


459081 
459145 


889426J 
918957 1 


AI903354AI903489AI9034a8 


459172 


921149 1 


BE063380BE063346AI906097 


459234 


945240 -1 


AI940425 
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TABLE 9B shows the genomic positioning for those primekeys lacking unigene ED's and 
accession numbers in Table 9. For each predicted exon, we have listed the genomic sequence 
source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the 

publication entitled The DNA sequence of human chromosome 22." Dunham I. et al. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntjosition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposition 


400452 8113550 


Minus 


90308-90505 


400557 9801261 




208453-208528,209633-209813 


400615 9908994 


Plus 


118036-118166,118681-118807 


400802 8567867 


Minus 


174571-174856 


400817 8569994 


Plus 


170793-170948 


400880 9931121 


Plus 


29235-29336,36363-36580 


400885 9958187 


Minus 


58242-58733 


400926 7651921 
400952 7658481 


Plus 


52033-52158 53956-54120 54957-55052 55420-55480 56452-56666,5725 
192667-192826,194387-194876 


400991 8096825 


Plus 


159197-159320 


401044 8117619 


Plus 


73501-73674 


401124 8570296 




124181-124391 


401163 6981820 


Plus 


5302-5545 


401201 9743387 


Minus 


138534-138629,139234-139294,140121-140335,142033-142479 


401286 9801342 


Minus 


147036-147318 


401384 6850939 




58360-58545 


401468 6433826 


Plus 


13056-13482 


401515 7630851 




29929-30126 


401519 6649315 


Plus 


157315-157950 


401672 9838136 


Plus 


128526-128704,130755-130860 


401744 2576349 


Pius 


14595-14751 


401851 7770425 


Minus 


1 46443-1 46664,147794-147971 ,148351-148480,148980-1491 1 1 ,149801- 


401866 8018106 


Pius 


73126-73623 


402240 7690131 


Plus 


104382-104527,106136-106372 


402359 9211204 


Minus 


40403-41961 


402535 9908890 


Minus 


174893-175050,183210-183435 


402788 9796102 


Plus 


98273-101430 
53242-53432 


402802 3287156 
402812 6010110 


Plus 


25026-25091,25844-25920 


402828 8918414 


Pius 


69071-69642 


402835 9187337 


Plus 


26961-27101 


402838 9369121 


Minus 


32589-32735,35478-35666 


402842 9369121 


Plus 


76355-76479 


402895 9967547 
402964 9581599 


Minus 


85537-85671,86379-86469 
46624-46784 


403137 9211494 


Minus 


92349-92572,92958-93084,93579-93712,93949-94072,94591-94748,952 


403237 7637807 


Plus 


7271-7527 


403259 7770585 


Plus 


46934857 


4036B3 7331517 


Plus 


217175-217446 


403690 7387384 


Minus 


78627-79583 


403708 5705981 


Minus 


134394-134812 


403838 4176355 


Plus 


19197-19502 


403851 7708872 


Plus 
Plus 


22733-23007 
24755-24969 


403976 7657840 
404407 7329316 


Minus 


48154-48499 


404426 7407959 


Plus 


77842-77954 


404632 9796668 


Plus 


45096-45229 


404741 8574139 


Plus 


143025-143467 


404756 7706327 


Plus 


82849-83627 


404946 7382189 


Plus 


134445-134750 


405074 7770440 


Plus 


44340-44559,44790-45059 


405125 8247873 


Plus 


137113-137814 


405172 S966752 


Plus 


153027-153262 
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405236 7249076 Minus 

405325 6094661 Minus 

405411 3451356 Minus 

405495 8050952 Minus 

405552 1552506 Plus 

405601 5B15493 Minus 

405685 4508129 Minus 

405777 7263187 Minus 

405856 7653009 Plus 

405876 6758747 Plus 

405932 7767812 Minus 

405934 6758795 Plus 

406006 8247B01 Minus 

406134 9163473 Plus 

406189 7289992 Minus 

406422 9256411 Plus 

406516 7711422 Minus 

406538 7711478 Plus 

406554 7711566 Plus 

406577 7711730 Plus 



151699-151915 
25818-26380 

17503-17778,18021-18290 

72182-72373 

45199-45547 

1 47835-147935,149220-149299 

37956-38097 

104773-105051 

101777-102043 

39694-40031 

123525-123713 

159913-160605 

4264042776 

153291-153452 

22007-22234 

163003-163311 

128375-128449,128560-128784 

35196-35367,38229-38476,40080-40216,43522-43340 

106956-107121 

11377-11509 
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TABLE 10: shows genes, including expression sequence tags differentially expressed in 
taxol resistant prostate tumor xenografts as compared to taxol sensitive prostate tumor 
xenografts. The genes are indicated as either being upregulated or downregulated during the 
induction of taxol resistance in sequential passages of the grafts. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Eos: Internal Eos name 

FO0-F14: passage number 



Pkey ExAccn 


UnigenelD UnigenTitle Eos Resp.FOO 


FOO 


F02 


F02 


F05 


F05 


F07 


FQ9 


F10 


F11 


F13 


F14 


117921 N51002 


Hs.47170 Liprin A2 PM28UP 1 


9 


8 


9 


32 


20 


34 


122 


105 


82 


71 


111 


112971 T17185 


Hs.4299 ESTs CHA1 down 290 


281 


267 


335 


270 


284 


150 


157 


83 


39 


49 


75 


126645 AI167942 


Hs.61635 STEAP PAA5down106 


111 


103 


71 


34 


67 




14 


2 


1 




1 


119018 N95798 


Hs.179809 ESTs PAB2 down 765 


841 


757 


909 


742 


704 


478 


428 


253 


175 


228 


238 


110844 N31952 


Hs.167531 ESTs PAV7down175 


192 


147 


141 


123 


129 


73 


65 


55 


48 


54 


84 


100654 HG2841-HT£ 


!969 Hs.75442 Albumin, A PM01 down 666 


605 


504 


728 


357 


445 


602 


187 


117 


127 117 113 


100655 HG2841-HT2970 Hs.75442 Albumin, A PM02 down 620 




486 


688 


368 


386 


606 


175 


101 


95 115 97 


102076 U09579 


Hs.252437 cydin-dep PM03down 101 


94 


143 


190 


105 


107 


88 


40 


34 


31 


46 


22 


102208 U22961 


Hs.75442 albumin PM04down495 


424 


323 


518 


252 


296 


467 


188 


169 


143 


165 


145 


103739 M075779 


mitochondr PM05down 75 


190 


606 


230 


378 


106 


218 




69 


192 


69 


99 


107036 AA599690 


Hs.15725 SBBI48 PM06down87 


124 


115 


188 


132 


111 


66 


71 


49 


70 




50 


108242 AA062746 


ESTs PM07down 14 


20 


252 


13 


22 


43 


193 


10 


10 


104 


21 


18 


108282 AA065143 


solute car PM08down27 


54 


178 


73 


103 


37 




24 




53 


15 


34 


108679 AA115963 


bela-1-glo PM09down680 


893 


1292 


: 656 


869 


389 




74 


118 


662 


359 


409 


108731 AA126313 


Hs.107476 ATPsynthaPM10down10 


19 


185 


25 


60 


1 


32 


3 


7 




1 


1 


110675 H89355 


Hs.6598 adrenergic PM11 down 207 


334 


237 


239 


231 


220 


119 


145 


93 


64 


56 


124 


115412 AA283804 


Hs.193552 ESTs PM12down 146 


316 


282 


271 


340 


334 


115 


238 


100 


196 


83 


207 


115844 AA430124 


Hs.234607 MDM2 PM13down49 


93 


94 


154 


132 


91 


23 


54 


23 


76 


14 


41 


120588 AA281591 


Hs.16193 ESTs PMWdownBO 


157 


58 


141 


159 


127 


39 


83 


35 


37 


16 


46 


132349 Y00705 


Hs.1 81 286 serine pro PM15down146 


217 


214 


150 


106 


128 


177 


85 


54 




66 


56 


132888 AA490775 


Hs.5920 N-acetylma PM16down 92 


150 


132 


178 


125 


139 


53 


94 


48 


67 


41 


80 


132967 AA032221 


Hs.61635 STEAP PM17down224 


208 


203 


215 


205 


180 


132 


65 


68 


50 


48 




133063 AA283085 


Hs.64065 ESTs PM18downB5 


148 


161 


150 


92 


108 


42 


99 


42 


65 


29 


126 


134374 D62633 


Hs.8238 ESTs PM19down230 


240 


194 


212 


231 


189 


89 


123 


107 


95 


68 


91 


135400 M23263 


Hs.99915 androgen r PM20down 36 


167 


99 


178 


132 


101 


23 


71 


26 


122 


14 


44 
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TABLE 11: shows genes, including expression sequence tags that are up-regulated in 
prostate tumor tissue compared to normal prostate tissue as analyzed using Affymetrix/Eos 
HuOl GeneChip array. Shown are the ratios of "average" normal prostate to "average" 
prostate cancer tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Background subtracted normal prostate : prostate tumor tissue 

























Hs.195188 








loqpqp 


X13810 


H S 1101 




HG2566-HT4792 




133448 


M34516 


Hs.170116 


125193 


W67577 


Hs.84298 










AA459310 






















134365 


R32377 


Hs.82240 
Hs.189885 


132335 


D60387 


Hs 32706 












HS.250B79 


133769 


M17733 


Hs.75968 


107904 


AA026648 


HS.613B9 








105987 


AA406631 


Hs.110299 


131466 


F03233 


Hs.27169 


102859 


X00274 


Hs.76807 


134626 


S82198 


Hs.8709 


134170 


M63138 


Hs.79572 


131713 


X57809 


Hs.181125 


100748 


HG3517-HT3711 




118769 


N74496 




111734 


R25375 




109221 


AA192755 


Hsi5840 6 
Hs.76719 


133846 
135281 


AA480073 
AA401575 


Hs.97757 


119073 


R32894 


Hs.45514 


100760 


HG3576-HT3779 




101426 


M19483 


Hs.25 


129568 


AA428025 


Hs.1 14360 


130900 


Z38468 


Hs.21036 


133879 


M13829 


Hs.77183 


100627 


HG2702-HT2798 




129424 


M55593 


Hs.1 11301 


128652 


AA621245 


Hs.103147 


129979 


T72635 


Hs.13956 


133468 


X03068 


Hs.73931 


102636 


U67092 




129536 


M33493 


Hs.184504 


133599 


M64788 


Hs.75151 



Unigene Title 

FBJ murine osteosarcoma viral oncogene homotog B 

Immunoglobulin kappa variable 1D-8 

glyceraldehyde-3-phosphate dehydrogenase 

immediate early protein 

POU domain; class 2; transcription factor 2 

Microtubule-Associated Protein Tau, Alt. Splica"3, Exon 8 

immunoglobulin lambda-like polypeptides 

CD74 antigen (invariant polypeptide of major histocompatibility 



ubiquitin C 

Homo sapiens mRNA; cDNA DKFZp586L1722 (from clone 
DKFZp586L1722) 

major histocompatibility complex; class II; DM beta 

CD74 antigen (invariant polypeptide of major histocompatibility 

complex; class II antigen-associated) 

Spliceosomal Protein Sap 62 

syntaxin 3A 

ESTs 

ESTs 

ESTs 

ESTs 

thymosin; beta 4; X chromosome 
ESTs 

ferritin; light polypeptide 
mitogen-activated protein kinase kinase 7 
ESTs 

Human HIA-DR alpha-chain mRNA 

catdecrin (serum calcium decreasing factor; elastase IV) 

calhepsin D (lysosomal aspartyl protease) 

immunoglobulin lambda gene cluster 

Alpha-1 -Antitrypsin, 5' End 

ESTs 

ESTs 

ESTs; Weakly similar to stac [H.sapiens] 
U6 snRNA-associated Sm-Iike protein 
ESTs 

v-ets avian erythroblastosis virus E26 oncogene related 

Major Histocompatibility Complex, Class li Beta W52 

ATP synthase; H+transprtng; mitochndrl F1 complex; beta polypept 

transforming growth factor beta-stimulated protein TSC-22 

ESTs; Moderately similar to F25965_3 [H.sapiens] 

v-raf murine sarcoma 3611 viral oncogene homolog 1 

Serine/Threonine Kinase (Gb225424) 

matrix metalloproteinase 2 (gelatinase A; 72kD gelalinase; 

72kD type IV collagenase) 

ESTs; Weakly similar to similar to SP:YR40_BACSU [Celegansj 
ESTs 

major histocompatibility complex; class II; DQ bata 1 
Human ataxia-telangiectasia locus protein (ATM) gene, exons 
1a, 1b, 2, 3 and 4, partial cds 
tryptase; alpha 

RAP1; GTPase activating protein 1 
194 



0.012 
0.015 
0.017 
0.017 
0.019 



0.024 
0.027 
0.027 
0.023 
0.028 
0.029 



0.033 
0.034 
0.034 
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102104 
131340 
130446 
101352 



108129 
130511 



132982 
131880 
130540 
133457 
101191 
101860 
102799 



102919 
100574 
131286 
102675 

131332 
101634 
113118 
124884 



100817 
106746 
135401 
130479 



133132 
121811 
129613 



114427 
118321 
118979 
107495 



U12139 

AM78305 

X79510 

L77701 

AA453310 

R39552 

Z14093 



D20350 
L14927 
M54915 



X12447 

HG2279-HT2375 

AA450092 

U72512 

R50487 

M57731 

T47906 

R77276 

W76097 

H26742 

AA454980 

H09751 

AA291139 

HG4011-HT4804 

AM76436 

L14813 

R44163 

U62015 

AA412165 

AM25137 

AA342422 

AA282133 

133487 

X80200 



AA424535 

AA279481 

S79854 

W95841 

Z83741 

F10874 

C02170 

AA435146 

AA609878 

AA206465 

AA017063 

N79070 



Hs.25817 
Hs.155693 
Hs.16297 



Hs.1584 

Hs.71190 
Hs.198118 
Hs.33818 
Hs.159534 
Hs.73931 
Hs.83656 
Hs.37165 



Hs.81170 
Hs.98858 
Hs.151531 

Hs.1 83760 

Hs.25300 



Hs.25717 

Hs.75765 

Hs.220512 

Hs.120911 

Hs.214507 

Hs.25367 

Hs.25601 

Hs.5038 

Hs.72242 



Hs.49322 
Hs.136031 
Hs.248174 



Human alpha1(XI) collagen (COL11A1) gene, 5' region and axon 1 

Homo sapiens chromosome 19; cosmid R27216 

protein tyrosine phosphatase; non-receptor type 21 

COX17 (yeast) homolog; cytochrome c oxidase assembly protein 

alpha-methylacyl-CoA racemase 

Homo sapiens clone 23522 mRNA sequence 

branched chain keto acid dehydrogenase E1 ; alpha polypeptide 

(maple syrup urine disease) 

ESTs; Weakly similar to !! ALU SUBFAMILY J WARNING 
ENTRY I! [H.sapiens] 

cartilage oligomeric matrix protein (pseudoachondroplasia; 

epiphyseal dysplasia 1; multiple) 

ESTs 

immunoglobulin lambda-like polypeptides 
RecQ protein-like 5 

protein tyrosine phosphatase; receptor type; S 
major histocompatibility complex; class II; DQ beta 1 
Rho GDP dissociation inhibitor (GDI) beta 
collagen; type IX; alpha 2 

Human endogenous retroviral H proteasa/integrase-derived ORF1 
mRNA, complete cds, and putative envelope prot mRNA, partial cds 
ESTs 

lipocalin 1 (protein migrating faster than albuminrtear prealbumin) 

pim-1 oncogene 

ESTs 

protein phosphatase 3 (formerly 2B); catalytic subunit; beta isoform 
(calcineurinAbeta) 



0.041 
0.041 
0.042 
0.042 
0.042 
0.042 



0.043 
0.043 
0.044 
0.044 
0.044 



0.044 
0.044 
0.044 



0.044 
0.045 
0.045 

0.045 
0.045 
0.046 
0.046 
0.046 
0.046 



0.046 
0.047 
0.047 
0.047 
0.047 
0.047 
0.048 



Hs.256470 
Hs.94789 



Triosephosphate Isomerase 

Homo sapiens clones 24718 and 24825 mRNA sequence 
Human B-cell receptor associated protein (hBAP) alternatively 
spliced mRNA, partial 3'UTR 
ESTs 

GR02 oncogene 
ESTs 
ESTs 
ESTs 

ESTs; Weakly similar to ALR [H.sapiens] 
chromodomain helicase DNA binding protein 3 
neuropathy target esterase 
ESTs 

Dystrophin-Associated Glycoprotein, 50 Kda, Alt. Splice 2 
ESTs 

carboxyl ester lipase-like (bile salt-stimulated lipase-like) 
Homo sapiens clone 23770 mRNA sequence 
cysteine-rich; angiogenic inducer; 61 
EST 

Homo sapiens chromosome 19; cosmid R28379 

ESTs 0.048 

ESTs; Weakly similar to similar to collagen [C.elegans] 0.048 

estrogen-related receptor alpha 0.048 

TNF receptor-associated factor 4 0.048 

Potassium Channel Protein.(GbZ1 1585) 0.048 

ESTs; Weakly similar to dJ393P12.2 [H.sapiens] 0.048 

ESTs 0.048 

ESTs; Weakly similar to collagen alpha 1(XVIII) chain [M.musculus] 0.049 

deiodinase; iodothyronine; type III 0.049 

ESTs 0.049 

H2A histone family; member M 0.049 

mitogen-activated protein kinase 8 interacting protein 1 0.049 

ESTs; Weakly smlr to weak smlrity to ribosomal prot L14 [C.elegans] 0.049 

ESTs 0.05 
ESTs; Weakly smlr to 1 10 KD CELL MEMBRANE GLYCOPROTEIN [H.sapiens] 0.813 

EST 0.05 

ESTs; Highly similar to Mlz-1 protein [Ksapiens] 0.05 

ESTs 0.05 

protein tyrosine phosphatase type IVA; member 3 0.05 

ESTs &051 

ESTs 0.051 



WO 02/30268 



PCT/US01/32045 



W81679 
AA482390 
R54798 
M17254 



130947 
129242 
131413 
5 112304 
101416 
131201 
101054 
101306 
10 129311 

129942 
119210 
101046 

15 114086 
110171 
101004 
129715 
101581 

20 113285 



25 101419 M17886 

119724 W69468 

102673 U72509 

129877 AA248589 

114788 AA156737 

30 123812 AA620807 



K02405 
L41143 
T55087 



H19964 
J04101 
N58479 



35 123193 
132595 
104161 
115330 
112893 

40 133475 
128699 
102940 
131299 
102495 

45 129594 



AA456471 

AA281145 

T08000 

L29217 

K03207 

X13956 

AA431464 

U51240 

R70379 

N69020 

U54602 



Z40782 

AA400378 

AM16947 

U05681 

AA479498 

X78706 

AA489010 

Z30644 

N59478 



65 110441 H50302 

125297 Z39215 

135258 AA292423 

130633 T92363 

112006 R42607 



Hs.12400 
Hs.21506 
Hs.5174 
Hs.26510 
Hs.26239 
Hs.45514 
Hs.24174 
Hs.73933 



Hs.76893 
Hs.93678 
Hs.177592 
Hs.47622 



Hs. 169401 

Hs.136956 

Hs.155742 

Hs.7724 

Hs.88827 

Hs.1 94684 

Hs.73987 

Hs.1 03972 

Hs.24998 



HS.31210 
Hs.25274 
Hs.12068 
Hs.178400 
Hs.123059 
Hs.48396 



Hs.13104 
Hs.19845 
Hs.159409 
Hs.97272 
Hs.1 78703 
Hs.22241 



ESTs 
ESTs 

ribosomal protein S17 

ESTs; Modly smlr to vacuolar prot sorting homolog r-vps33b [R.norvegicus] 
ESTs 

v-ets avian erythroblastosis virus E26 oncogens related 
ESTs 

Human MHC class II HLA-DQ-beta mRNA (DR7 DQw2); complete cds 

T-cell leukemia translocation altered gene 

yb45c08.r1 Stratagene fetal spleen (#937205) Homo sapiens cONA 

clone IMAGE74126 5', mRNA sequence. 

phospholipaseA2; group X 

ESTs 

Accession not listed in Genbank 

Homo sapiens PAC clone DJ0777O23 from 7p14-p15 

ESTs 

v-ets avian erythroblastosis virus E26 oncogene homolog 1 

ESTs; Weakly similar to LR8 [H.sapiens] 

major histocompatibility complex; class II; DQ alpha 1 

ESTs 

ESTs 

Cpg-Enricned Dna, Clone S19 

3-hydroxybutyrate dehydrogenase (heart; mitochondrial) 

ESTs 

ribosomal protein; large; P1 



a disintegrin and metalloproteinase domain 15 (metargidin) 



ESTs 

glyoxylate reductasa/hydroxypyruvate reductase 
KIAA0963 protein 
ESTs 

bassoon (presynaptic cytomatrix protein) 
CDC-like kinase 3 

proline-rich protein BstNl subfamily 4 
Hu 12S RNA induced by poly(rl); poly(rC) and N< 
ESTs; Weakly similar to unknown [H .sapiens] 
Lysosomal-associated multispanning membrane protein-5 
Human germline IgD chain gene; C-region; C-delta-1 domain 
EST 

keratin 17 

sema domain; immunoglobulin domain (Ig); short basic domain; 

secreted; (semaphorin) 3E 

alpha-2-plasmin Inhibitor 

similar to S68401 (cattle) glucose induced gene 

ESTs 

ESTs; Highly similar to KIAA0612 protein [H.sapiens] 
B-csllCLl/lymphoma3 

ESTs; Modly smlr to putative seven pass transmembrane prot [H.sapiens] 

carnitine acetyltransferase 

ESTs 

chloride channel Kb 

ESTs; Moderately similar to tumor necrosis factor-alpha 
-induced protein B12 [H.sapiens] 
growth differentiation factor 1 1 

leukocyte immunoglobulin-Iike receptor; subfamily B (with TM 

and ITIM domains); member 3 

EST 

ESTs; Highly smlr to prot phosphatase 2A BR gamma subunit [H.sapiens] 
ESTs 

ESTs; Weakly similar to dJ281 H8.2 [H.sapiens] 
ESTs 

hypothetical protein 
196 



0.051 
0.052 
0.052 



0.053 
0.053 
0.053 



0.054 
0.054 
0.054 
0.054 
0.054 
0.055 



0.055 
0.055 
0.055 



0.055 
0.056 
0.056 
0.056 
0.056 
0.053 
0.05S 
0.056 



0.057 
0.057 
0.057 
0.057 
0.057 
0.057 
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AA404565 
N35489 

HG1872-HT1907 



121828 
133418 
129317 
130153 
124403 
127683 
129814 
131770 
117557 



105719 
124226 
117437 

132741 
134437 
107664 



103495 
129607 
106467 



103996 
110505 
133912 



N46244 
D85815 
N31745 
M668123 



Y10514 

W91960 

U15460 

M609183 

R46100 

V00563 



AA219179 

J04444 

L18983 

KD1396 

M621065 

H05704 

AA291644 

H62396 

N27645 

AA394133 

M26041 

M010594 

AA349417 

M34182 

C00476 

Y09022 

AM045S4 

AA450040 



AA297739 
AA321355 



Hs.1116 

Hs.17409 

Hs.83577 

Hs.32060 

Hs.58679 

Hs.98497 

Hs.1 72727 

Hs.1 10373 

Hs.15114 

Hs.102493 

Hs.134170 



Hs.181131 
Hs.9739 
Hs.1 79543 
Hs.82520 
Hs.1 6725 
Hs.105230 
Hs.72620 
Hs.1 11076 
Hs.1 1081 
Hs.97562 
Hs.6147 
Hs.1 10757 
Hs.1 12471 
Hs.19105 
Hs.697 
Hs.89655 
Hs.75621 
Hs.1 12943 
Hs.1 10746 
Hs.36793 
Hs.1 90266 



Hs.1 98253 

Hs.5326 

Hs.96917 

Hs.158029 

Hs.24395 

Hs.153591 

Hs.1 1607 

Hs.154162 

Hs.1 06443 



Hs.20495 
Hs.77522 
Hs.180255 



sodium channel; voltage-gated; type I; beta polypeptide 
KIAA01 80 protein 

ESTs; Moderately similar to kinesin light chain 1 [M.musculus] 
neurochondrin 

Major Histocompatibility Complex, Dg 
EST 

' ESTs; Moderately similar to UV-1 protein [H.sapiens] 
KIAA0906 protein 

lymphotoxin beta receptor (TNFR superfamily; member3 

cysteine-rrch protein 1 (intestinal) 

cysteine and glycine-rich protein 3 (cardiac LIM protein) 

ESTs 

ESTs; Weakly similar to 4F2/CD98 light chain [M.musculus] 
ESTs 

Treacher Collins-Franceschetti syndrome 1 
ESTs 

ras homolog gene family; member D 



0.059 
0.059 
0.059 



H^apiens mRNA for CD152 protein - 0.06 

sequence-specific single-stranded-DNA-binding protein 0.06 

activating transcription factor B 0.06 

ESTs O.06 

ESTs 0.061 

immunoglobulin mu 0.061 

Hu 1.1 kb mRNA upregKd in retinoic acid treated HL-60 nautrophiliccells 0.061 

ESTs 0.061 

ESTs; Weakly similar to dJ963K232 [H.sapiens] 0.061 

DKFZP434I1 14 protein 0.061 

malate dehydrogenase 2; NAD (mitochondrial) 0.061 

ESTs; Weakly similar to HPBRH-7 protein [H.sapiens] 0.061 

ESTs 0.061 

KIAA1075 protein 0.061 

DNA segment on chromosome 21 (unique) 2056 expressed sequence 0.061 

ESTs 0.061 

translocase of Inner mitochondrial membrane 17 (yeast) homolog B 0.061 

cytochrome c-1 0.062 

protein tyrosine phosphatase; receptor type; N 0.062 

protease inhibitor 1 (anti-elastase); alpha-1 -antitrypsin 0.062 

ESTs 0.052 

H sapiens HCR (a-helix coiled-coii rod homologue) mRNA; complete cds 0.062 

ESTs 0.062 

ESTS 0.062 
yw5e3.s1 Weizmann Olfactory Epithelium H sapiens cDNA clone 

IMAGE-.255676 3' smlr to contains L1 .t3 L1 repetitive element ;, mRNA seq 0.062 

ESTs; Highly similar to OASIS protein [M.musculus] 0.062 

major histocompatibility complex; class II; DQ alpha 1 0.062 

ESTs; Moderately similar to pim-1 protein [H.sapiens] 0.062 

ESTs 0.062 

protein kinase; cAMP-dependent; catalytic; gamma 0.062 

small inducible cytokine subfamily B (Cys-X-Cys); member 14 (BRAK) 0.062 

Not56 (D. melanogaster)-like protein 0.062 

ESTs 0.062 

ADP-ribosylation factor-like 2 0.062 

ESTs 0.062 

Macrophage Scavenger Receptor, Alt. Splice 2 0.062 
ESTs; Weakly similar to I! ALU SUBFAMILY J WARNING ENTRY !! [Rsapiens] 0.062 

ESTs 0.062 

sparc/osteonectin; cwcv and kazal-like domains proteoglycan (testican) 0.063 

H.sapiens isoform 1 gene for L-type calcium channel, exon 1 0.063 
ESTs; Weakly similar to ISOLEUCYL-TRNA SYNTHETASE; 

CYTOPLASMIC [H.sapiens] 0.063 

EST2393 Bone marrow Homo sapiens cDNA 5' end, mRNA sequence 0.063 

DKFZP434F011 protein 0.063 

major histocompatibility complex; class II; DM alpha 0.063 

major histocompatibility complex; class II; DR beta 1 0.063 
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105595 
100106 
134715 



132717 
128863 
115193 



AA521488 
X74794 
AA1024B9 
AA070473 



Y10141 
U40671 
AA417821 
AA457735 



N74597 

AA036794 

T10792 

AM06083 

T16275 



AF015910 
AA282757 
AA480109 



F02322 

T12559 

AA156597 

T10316 

AA256073 

AA278412 

M87789 

H03387 

H93721 

AM00138 

U12707 

U24183 



Hs.150922 

Hs.5307 

Hs.178009 



Hs.12373 
Hs.94517 
Hs.98910 
Hs.6455 



Hs.17719 
Hs.171014 
Hs.18858 



Hs.23466 
Hs.108268 
Hs.77628 
Hs.28355 
Hs.1 10453 
Hs.180535 



Hs.89040 
Hs.9963 
Hs.251651 



Hs.76751 

Hs.821 

Hs.26135 

Hs.221382 

HS556441 

Hs.4302 

Hs.190626 

Hs.21346 

Hs.140 

Hs.241305 

Hs.20798 

Hs.97703 

Hs.2157 

Hs.75160 

Hs.8021 

Hs.82143 

Hs.50093 

Hs.9527 



Hs.77225 
Hs.151696 
Hs.106674 
Hs.88218 
Hs.141605 
Hs.42846 
Hs.32757 



BCS1 (yeast homok>g)-like 

synaptopodin 

ESTs 

transcription elongation factor A (Sll); 2 

slte-1 protease (subtillsin-like; sterol-regulated; cleaves sterol regulatory 

element binding proteins) 

ESTs 

KIM01 28 protein 

minichromosome maintenance deficient (S. cerevisiae) 4 
ESTs 

zm7c8.s1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA 
clone IMAGE'5399 3', mRNA sequence 
KIAA0422 protein 

ESTs; Weakly similar to predicted using Genefinder [C.elegans] 
ESTs 

RuvB (E coll homolog)-like 2 

ya94a02.s1 Stratagene placenta (#937225) Homo sapiens cDNA clone 

IMAGE:69290 3', mRNA sequence. 

ESTs 

VGF nsive growth factor inducible 

phospholipase A2; group IVC (oytosolic; calcium-independent) 

H.sapiens DAT1 gene, partial, VNTR 

Bgaselll-.DNAiATP-dependent 

ESTs; Highly similar to CGI-69 protein [H.sapiens] 

IMP (inosina monophosphate) dehydrogenase 1 

ESTs 

ESTs 

steroidogenic acute regulatory protein related 
ESTs 

ESTs; Highly similar to CGI-38 protein [H.sapiens] 

ESTs; Weakly similar to mitogen inducible gene mig-2 [H.sapiens] 

ESTs; Weakly similar to T20B12.3 [C.elegans] 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens unknown protein mRNA, partial cds 
prepronociceptin 

TYRO protein tyrosine kinase binding protein 
EST 

dimelhylarginine dimethylaminohydrolasa 2 

tyric acid transaminase mRNA, partial cds 



Human gamr 
biglycan 
ESTs 
ESTs 

EST; Moderately similar to CGI-136 protein [H.sapiens] 

ESTs 

ESTs 

ESTs; Weakly similar to F42C5.7 gene product [C.elegans] 

immunoglobulin gamma 3 (Gm marker) 

estrogen-responsive B box protein 

ESTs 

ESTs 

Wiskott-Aldrich syndrome (ecezema-thrombocytopenia) 



0.064 
0.064 
0.064 



0.064 . 
0.064 
0.064 
0.064 



KIAA1058 protein 
ribosomal protein SI 2 
ESTs 

ESTs; Highly similar to HSPC013 [H.sapiens] 

Human amiloride-sensitive epithelial sodium channel gamma subunit mRNA, 

5' end, partial cds 

ADP-ribosyltransferase (NAD+; poly (ADP-ribose) polymerase)-like 1 
DKFZP727G051 protein 

BRCA1 associated protein-1 (ubiquitin carboxy-terminal hydrolase) 

ESTs 

ESTs 

ESTs 

ESTs 
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0.066 
0.066 
0.066 



0.067 
0.067 
0.067 
0.067 
0.067 
0.067 



0.067 
0.067 
0.067 
0.067 
0.067 
0.067 
0.067 
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U70663 
AA348910 
R72079 
AM8S869 



100756 
10 113483 
101119 



119147 
104900 

20 133185 
115496 
121005 
124869 
129154 

25 112161 
125251 
134298 

119745 
30 131306 
107776 
134271 
101798 



118742 



112270 
116704 
129890 
127345 
112436 
114531 
135122 



112221 
100478 
115598 



N23673 
R48295 
W87486 
J00116 

W70264 

AA232686 

AA018820 



N74052 
N64656 
X12517 

HG371-HT26388 

N54361 

X80822 



H99080 

AA281338 

AA215369 



M402624 
AA432121 
H47542 
AB002344 



HG1067-HT1067 

AM00129 

AA227137 



Kruppet-like factor 4 (gut) 

DOM-3 (C. elegans) homolog Z 

CD79B antigen (immunoglobulin-associated beta) 

ESTs 

Proto-Oncogene C-Myc, Alt. Splice 3, Off 1 14 

ye20f05.s1 Stratagene lung (#937210) H sapiens cDNA clone IMAGE: 

3' similar to contains Alu repetitive element;contains MER12 repetitive element; 

mRNA sequence. 

Zinc Finger Protein (Gb:M88357) 

ESTs 

complement component 2 
interleukin 15 receptor; alpha 
collagen-binding protein 2 (colligen 2) 
plasminogen activator inhibitor; type I 
ESTs; Weakly similar to T25G3.1 [C.elegans] 
ESTs 

CD14 antigen 
ESTs 

ESTs; Weakly similar to ACROSIN PRECURSOR [Rsapiens] 



jn factor 4E binding protein 1 
ESTs 

ESTs; Weakly similar to F55A12.9 [C.elegans] ~ 
mannosidase; alpha; class 2B; member 1 

ESTs; Wkly smlr to !! ALU SUBFAMILY J WARNING ENTRY !! [Usapiens] 
ESTs 

collagen; type II; alpha 1 (primary osteoarthritis; spondyloepiphyseal 

dysplasia; congenital) 

ESTs 

ESTs 



EST 

Homo sapiens clone 24940 mRNA sequence 
small nuclear ribonucleoprotein polypeptide C 
Mucin 1, Epithelial, Alt. Splice 9 
ESTs 

ribosomal protein L18a 
ESTs 

Homo sapiens clone 24432 mRNA sequence 

ESTs; Weakly similar to WASP-family protein [Rsapiens] 

Human L2-9 transcript of prearranged immunoglobulin V(H)5 psaudogene 

calnexin 

ESTs 

EST 

ceruloplasmin (ferroxidase) 

ESTs; Highly similar to KIAA0476 protein [H.sapiens] 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564C186 (from clone DKFZp564C186) 
ESTs; Weakly similar to hypothetical protein [H.sapiens] 



synuclein; gamma (breast cancer-specific protein 1) 



KIAA0346 protein 
ESTs 

Mucin (Gb:M22406) 
ESTs 

KIAA0828 protein 

Human alpfia-f collagen type II gene, exons 1, 2 and 3 
ESTs 

DKFZP434B103 protein 
KIAA02 11 gene product 
ESTs 

199 



0.071 
0.071 
0.071 
0.071 



0.071 
0.071 
0.071 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.072 
0.073 
0.073 
0.073 
0.073 
0.073 
0.073 
0.073 
0.073 
0.073 



